FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACAT TTTGTGAAGT TGTAAAACAGAAAACCTGT TAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGG 
T ACAGTAGCT CCAGAAAAAT GC TTAT T T GGGGCAAT GCTAAATATTGCGGCAGT T TTAT GCAT T G 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTTTGCACAGTGGCAATTTTGGGACTGATTTAGAACAGAAACTCCATTGGAACCCCGAGG 
ACAAAGGT TAT GTGC T T CACAT GATCACTACT GCAGCAGAATGGTC TAT G TCAT TT TCCT T CT TT 
GGT t T TT T CC TGAC T TACATT CGT GAT T T T CAGAAAATT TCTT TACGGGT GGAAGCCAAT T T ACA 
T GGAT T AACCC TC TATGACAC TGCACCT T GCCC TAT T AACAAT GAAC GAAC AC GGC T ACT T T CCA 
GAGATAT T TGAT GAAAGGATAAAATAT T T C TGTAATGAT TAT GATTC TC AGGGAT TGGGGAAAGG 
TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
GAT GAAT GC T G ATAAT C AG G AAAC AT G AAAG AAG C CAT T T G AT AGA T TAT T C T AAAGGAT AT CAT 
CAAGAAGAC TATTAAAAACACC TATGCC TAT AC TT TT TTAT CT CAGAAAATAAAG TCAAAAGAC T 
ATG 



FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLS FLPSALVI WTSAAFI FS YI TAVTLHHI DPALPYI SDTGTVAPEKCLFGAMLNIAAV 
LCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

y Important features: 

Zl Type II transmembrane domain: 

|r| amino acids 13-33 

Ml 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 



|f| N-myristoylation sites. 



5 ;: 



K 5 



amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 

GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 

CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 

GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 

GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 

AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 

TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 

CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 

CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 

TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 

CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 

CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 

AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 

CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 

ATAT GCTACCAAGGAGAGAATAGAATACT CCAAAACCTTCAAGGGGAAATAT TTTAATTTT CTT GGTT 

ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 

GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 

GAAGTTT T GGTCCCAACACAT TT CCTT CATTCTTGT T GGAATAAT CATCGTCACATCCATCAGAGGAT 

TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 

CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 

AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 

ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 

GAGAAGCAAATGGCACCT TGAACT TAAGCCTACTACAGACTGT TAGAGGCCAGTGGT TT CAAAAT TTA 

GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 

ATTTT TCACCT TCATAGCATACT CCTT CCCCGTCAGGTGATACTATGACCAT GAGTAGCATCAGCCAG 

AACAT GAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCAT CCCGTGTGGATAT GAGGCTGG 

TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 

CTAT GGTAGCTGAGCCAAACACGTAGGAT TTCCGTTT TAAGGT T CACAT GGAAAAGGTTATAGCT TTG 

CCTT GAGATTGACTCAT TAAAAT CAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 

ACT C T AGAG T C GACC T GCAGAAGC T T GG CCGCCATG GC C C AAC T T GT T TAT T G CAGC TT ATAAT G 



FIGURE A 
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MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTIMALLSGFGAWCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMI I SKKKRMZ\MARRTMFQKGE VHNKPSGFWGMI KSVTT S AS GS ENLT L I QQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS S VLLIRMSMPLE YRT 1 1 TEVLGELQFNFYHRWFDVI FLVSALS S I LFLYLAHKQAPEKQMAP 

Important features: 



gl Signal peptide: 



amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin -binding proteins 

amino acids 151-160 
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FIGURE 5 

AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GT CATGCAGAATTAT GGGGAT CACCC T T GTGAGCAAAAAGGCGAACCAGCAGC TGAATT T CACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
ATTATCACCACCAAAGATCCCATAT TCAACACTCAAACT GCAACACAAACAACAGAATTTAT TGT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 

C TGC TCCAGCT T CCAC TT CT AT TCCACGGAGAAAAAAAT T GAT T TGT GTCACAGAAGT T T T TAT G 
GAAACTAGCACCATGTCTACAGAAACTGAACCATTTGTTGAAAATAAAGCAGCATTCAAGAATGA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAAT GATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAAT GATAGCAACCCT AA 
T GAGGAAT CAAAGAAAACT GATAAAAACCCAGAAGAGT CCAAGAGT C CAAGCAAAAC TACCGTGC 
GATGCCT GGAAGC TGAAGT T TAGAT GAGACAGAAAT GAGGAGACACACCTGAGGC TGGT T TC T T T 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
CCCTTGGTT CCTAAC T GGAATCAGCT CAGGAC T GCCATTGGACTATGGAGTGCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GCTCT GAAAGAGAAACACGTATCCCACCTGACATGTCCTTCT GAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGT T TAGCCCCT GAAAGCCAT GGAGATTCT CATAAC T TGAGACCTAATCT CT GT AAA 
GC TAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACAC TGT CAGCAGGGACT GTAAAC 
ACAGACAGGGT CAAAGT GT T T TC T C TGAACACAT TGAGT TGGAAT CACT GT T TAGAACACACACA 
CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 

AAAAT AAA&AC T C T TAT AAAT T T C TAT T T T TAT C T GAG T T AC AGAAAT GAT T AC TAAGGAAG AT T 
ACTCAGTAATTTGTTTAAAAAGTAATAAAATTCAACAAACATTTGCTGAATAGCTACTATATGTC 
AAGT GCTGT GCAAGGTATT ACAC TC TGTAAT TGAATAT TAT T CC T CAAAAAAT TGCACATAGTAG 
AACGC TAT CT GGGAAGC TAT t TT TT T CAGT T T TGATAT TTCTAGC T TAT CTACTT CCAAACTAAT 
T T T TAT T T T TGC TGAGACT AATC T TAT T CAT TT T C TC TAAT ATGGCAACCAT TATAACC T TAAT T 
TAT TAT TAACATACCT AAGAAGTACAT TGT T ACC T CTAT AT ACCAAAGCACAT T T TAAAAGT GC C 
AT TAACAAAT GTAT CACT AGCCC T CCT TT T T CCAACAAGAAGGGAC TGAGAGATGCAGAAAT AT T 
TGTGACAAAAAAT TAAAGCAT TTAGAAAACTT 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LS LAGKDQVE TALKAS FE TCS YGWVGDG FWI S RI S PNPKCGKNGVGVL I WKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 



Signal sequence: 

H amino acids 1-16 

U Transmembrane domain: 

I A amino acids 235-254 



TP?' 

ru 



N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site, 
amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 

GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 

CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 

CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 

GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 

GGAGGACACGCAGCACAAAT T GCGCAGCGCGG T GGAAGAGATGGAGGCAGAAGAAGC TGCT GCTA 

AAGCATCAT CAGAAG T GAACC T GGCAAACT TACC TCCCAGCTATCACAAT GAGAC CAACACAGAC 

ACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 

TGGACAAATGGTCT T TTCAGAGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 

ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 

ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 

GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 

ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 

ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 

CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 

CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 

GAGATCCT GCTGCCCAGAGAGGTCCCCGATGAGTATGAAGT TGGCAGCT TCATGGAGGAGGT GCG 

CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 

CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 

TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 

CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 

CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 

GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 

TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 

T GAT TGGT T T TGGGGAAAT GTGGAGAAGAGT GCCCTGCTTT GCAAACAT CAACC T GGCAAAAAT G 

CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 

AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 

TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 

AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 

CAAGTCACACAGCTAGT GAAGACC AGAGCAGT T T CAT CT GGT TGT GAC TCTAAGC TCAGT GCT CT 

C TCCACTACCCCACACCAGCCT TGGT GCCACCAAAAGT GC T CCCCAAAAGGAAGGAGAAT GGGAT 

T T T TCTTGAGGCATGCACAT C T GGAAT T AAGGTCAAACTAAT TC TCACATCC C TC TAAAAGTAAA 

CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 

ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 

TACAGGTTAACCTGCAGAAACAGTACTTAGGTAAT TGTAGGGCGAGGATTATAAATGAAATT TGC 

AAAATCAC T TAGCAGCAAC TGAAGACAAT TATCAACCACGT GGAGAAAATCAAAC CGAGCAGGGC 

TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 

TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 

CATGAT TGT ATAAGCAT GCT TT CT T T GAGT T T TAAAT TATG TATAAACATAAGT TGCAT T TAGAA 

AT CAAGCATAAATCAC T T CAAC TGCAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RS AVEEMEAEEAAAKAS SEVNLANLPPS YHNETNTDTKVGNNT IHVHRE I HKI TNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 



Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 



Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 



N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 



APP ID=10063563 



Page 160 of 320 



FIGURE Q 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
CAAGTGCTGCC TCT GGT G T CT GGAAAAAT T TAT CAAG TT CC TAAACCGCAATGCATACAT CATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
AC T T T AAGAGCCCCCACC T CAAC TAT T AC T GGC TGCCCAT CAT GACC T CCAT CCT GGGGGCC TAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGAT T CT G GGCAAGAAGAACGAGGCGCCCC CGGACAACAAGAAGAGGAAGAAGTGACAGCT CCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAA.GGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGC CAC T GCAC TCCAACC TGGGTGACAGACT CT GT CT CCAAZ^ACAAAACAAACAAACAAA 
AAGAT T T TAT TAAAGAT AT T T TGT TAACTC 



FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLWSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NOTLALGQCVLAGAFAS FYWAFHKPQDI PT FPL I S AFI RTLRYHTGSLAFGAL I LTLVQ I ARVI L 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site, 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 

TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 

CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 

TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 

GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 

CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 

GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 

ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 

CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 

GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 

TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 

CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 

CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTGTACACCATG 

TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 

GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 

TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 

AACAGCCT GATGCAGACCGAGGAGT GCCCACCTAT GCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 

GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 

ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 

ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 

CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 

CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 

CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 

ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 

AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 

TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 

GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 

YKLPOTCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 

NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 

ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 

QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 

VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 

SFFHFCLVLASLHVMMTLTNWYKPGETRKMISTWTAVWW 

FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAZ\AGACTCCTGTAACCCTCCTCCAGGATGAACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAAT GGAGGCAT TGAGAACACATTAGAGAAGGAGGT GAT GCAG TATGAC T AC TATTCT T CAT AT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAGAC T CCT GATAGT T CAGGAT GC T T CAGAGAGGGCAGCAC TTATACCT GGT GGTC T T T CT GATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACT TTTAGAACT ATGAG TACTACTT TTGT TAAATGTGAAAAACCCTCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CACTGCTGGCTTTATTGAACAGCTAATAAAGATTTATTTATTGTAATACCTCACAAACGTTGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
T C AG T GAG AC T GAGC C T GAT G T G T T AACAAAT AGG T G AAG AAAG TCTTGTGCTG TAT T C C T AAT C 
AAAAGAC T TAAT AT AT T GAAGT AACAC T T T T T T AGTAAGCAAGATACC T T T T TAT T TCAAT T CAC 
AGAATGGAAT TTTTTTGTTT CATG T C T CAGAT T TAT T T TGTAT TTCTTTTT T AACAC TCTACAT T 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C T TGCAAAAGGGGAAGAAAGGAAT TGCGAATACATGTAAAATGT CACCAGACAT T TGTAT TAT TT 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACTTAAACCAT TCATATCATGT TTCCTTTGCGT TCAGCCAATTT CAATTAAAATGAA 
CTAAATTAAAAA 
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FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQmARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWIIELNWGGIENTLEKEVMQYDYYSSYFDIFLIAVFRFKVLILAYAVCRLRHWWAIALTT 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFIIAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myri s toyl a ti on si tes . 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
1^ CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
111 GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
;*? TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
U CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
0! TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
|s| TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
ifj TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
JlT CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
ffl CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
UJ TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
, CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
f» ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 

GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
CJ AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
fll GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 

GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
5!! GCGAGGAGGCGTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
W CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAflAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTC T CAG T CC CAACC TC GGGGATGT GTGCAGACAGGGCT GTGT GACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGT GAC TC TAG TCT T GGCCCCAGGAAGCGAAGGAACAAAAGAAACT GGAAAGGAAGAT GC 
T T TAGGAACATGT TTTGCTTTTT TAAAATAT ATATAT T TATAAGAGAT CCT TTCCCAT T TAT T C T 
GGGAAGAT GT T T T TCAAACT CAGAGACAAGGAC TTTGGTTTT TGTAAGACAAACGATGATAT GAA 
GGCC TT TT GTAAGAAAAAAT AAAAGAT GAAG TG TGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGIAALQELDVSNLSLQALPGDLSGLFPRLRLIAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
T VT QLRPNAT YS VCVMPLGPGRVPEGEEACGEAHT P PAVHSNHAPVT QARE GNLPLL IAPALAAV 
LLAAIJ^VGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature, 
amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTC ATGC G 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCC T TAGATTCCAAGAC TAC T T TGACAT CAGATGAGTCAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGT TGCTGGTCAAATATTTCT T GAT TCAGAAGAATCTGA 
AT TAGAAT CCT CTAT T CAAGAAGAGGAAGACAGCC TCAAGAGCCAAGAGGGGGAAAGTG T CACAG 
AAGAT AT CAGC T T TC TAGAGTC T CCAAAT CCAGAAAACAAGGAC TAT GAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
?™ TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTG 
"if C TACAACCTAT GACTACAAAGCAGAT GAAAAGTGGGGC TT T TG T GAAACT GAAGAAGAGGCT GCT 

f|| AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAG 
||| CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
%I f CCAAAGCGC TGGAGAGAG T GTCATATGCT CT TT TAT T T GGT GAT TAC T T GCCACAGAATATC CAG 

— GCAGCGAGAGAGATG TT TGAGAAGCT GACT GAGGAAGGCT C TCCCAAGGGACAGAC TGCTCT T GG 

w 

CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT TAG TGGAAGGCT 
Ifl AATAAT AT TAACATCAGAAGAAT T TGTGGT T TATAGCGGCCACAAC T T TT T CAGC T TT CAT GATC 

tl CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 
:; AACACATGGAATC TACAT GTAAAT GAAAGT T GGTGGAGTCCACAAT T T T TC T T TAAAAT GAT TAG 

o 

TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 

III 

GAATTGTCAGAATCATTTTTTACATTAGATTATCATAATTTTAAAAATTTTTCTTTAGTTTTTCA 
AAATT T T GTAAAT GGT GGCTATAGAAAAACAACATGAAAT AT TATACAATAT T T TGCAACAAT GC 
CC TAAGAAT T GT TAAAAT T CAT GGAGT TAT T TGTGCAGAAT GACTCCAGAGAGC T C TAC T T T C TG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGATTAACTCAT TT TTAATAAAATTATGTCTAAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



W 

3 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSKESEL 
ESS I QEEEDSLKSQEGESVTEDI S FLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEl^QTGffilLNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALV Y YT FGALGGNL I AHMVLVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C- terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE iq 

AAT TCAGAT T T TAAGCCCAT T C TGCAGT GGAAT T TCAT GAAC TAGCAAGAGGACACCATCT TCT T 
GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAA ATG CTCTTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATTT TTATCACTGGATGTGACTCGGGCT T TGGAAAC TTGGCAGCCAGAACTTT TGAT 
AAAAAGGGAT T TCAT GT AATCGCTGCCT GT CT GAC T GAAT CAGGAT CAACAGC T T TAAAGGCAGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
H CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
C CT GT TT GGAC TCATCAGT GT GACAC TAAATAT GC T T CC T T T GG TCAAGAAAGC TCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
| t | TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
Iff ATGCATTGAACCAGGATTGTTCAAAACAAACTTGGCAGATCCAGTAAAGGTAATTGAAAAAAAAC 
10 TCGCCAT TTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGT TACATTGAAAAA 

m AGT CTAGACAAACTGAAAGGCAATAAATCCTAT GT GAACATGGACC TCT C T CCGGTGGTAGAGT G 

m CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 

W 

HI TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
Q GCAGAGC T GGC TAAT CCCAAGGCAGTGTGACT CAGCTAACCACAAAT GT C T CC TC CAGGCTATGA 

III AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
!:;! CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENVKRTAQWVKNQVGEKGLWGLIIMAGVPGVIAPTDWLTLEDY 
REP IEVNLFGL I SVTLNMLPLVKKAQGRVINVS SVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSC I E PGL FKTNLADPVKVIEKKLAI WEQLS PDI KQQYGEGYI EKS LDKLKGNKS YVNMD 
LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 



Important features of the protein: 
M Signal peptide: 

amino acids 1-17 

AIR.!. 

l.ii Transmembrane domain: 

f|| amino acids 136-152 

hi 

N-glycosylation sites. 

L. amino acids 161-163, 187-190 and 253-256 



Q Glycosaminoglycan attachment site 

Ill amino acids 39-42 

III 



N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATCGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACAGGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTATACAATTGA 
CAT TCAGAAATATAT TCCATGC TAT CAGCT T T T TAGC TT TTATAAT T C T T CAGGCGAAGTAAAT G 
AGCAAGCACTGAAGAAAATAT TATCAAAT GTCAAAAAGAAT GT GGTAGGT T GGTACAAAT TCCGT 
CGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAAAAACTTGCAGGAGCATTT 
TTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAATAACAGAAAGCTGCTCTACTC 
H> ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
P GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
Jj* TTTTAGCCGAGCAGTACAAACACACAGCTCTAAATTTTTTGAAGAAGATGGATCCTTAAAGGAGG 
I I TACATAAGATAAAT GAAAT GTATGCT T CAT T ACAAGAGGAATTAAAGAGTATATGCAAAAAAGTG 

Ifl GAAGACAGTGAACAAGCAGTAGATAAACT AGTAAA.GGAT GT AAACAGAT TAAAAC GAGAAAT T GA 

ffl GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
* ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
;L AT GTC TT TAAAAAATAGACATG T T TC TAAAAG TAGC T GTAACTACAAC CACCATCTCGAT GTAGT 

III AGACAAT CTGACC TTAATGGTAGAACACACT GACAT T CCT GAAGCTAGTCCAGC TAGTACACCAC 

g| AAATCAT TAAGCATAAAGCC TTAGAC T T AGATGACAGAT GGCAAT TCAAGAGAT C T CGGTT GTTA 

ill GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
tl GAGCAGCCCAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTC 

C TAC AT T T TGA. T C C T T T T AAC C T T AC AAGG AGAT T T T T T TAT T T GG C T GAT G GG T AAAG C C AAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTT TGCAGTAATACACAGATAACTCTTAGTGCATT TACT TCACAAAGTACTTTT TCAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
AC ACAT T CTT TAGAAT TGGAAAAG T GAGACCAGGCACAGT GGC T CACACCT GTAATCCCAGCAC T 
TAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCCTGGGCAACGTATT 
GAGACCAT GTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTAT TT TCAAAATATGGAAA 
GAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAGTGATACTTTTTTAGAAGTA 
CATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCAATAAATTTGCAAAACATCATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGE S T S AVLS GFVLGALAFQHLNTDS DTE G FLLGE VKGEAKNS I TDS QMDDVE WYT I DI QKYI 
PCYQLFSFYNSSGEVNEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQEMIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEE IEKMKGFGEYSRSPT F 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 



APP_ID=1 0063563 



Page 174 



FIGURE 23 



G GCACAGC CGCGCGGC GGAG GGCAGAGTCAGC CGAGCCGAGTC CAGCCGGACGAGC GGAC CAGCGCAG GGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
CAAGAGAT G GAGGAGAAGTACAG CCT GCAAAGTAAAGAT GATTT CAAAAGT GT G GT CAGCGAACAGTGCAATCAT T T G 
CAAGCT GT CTTTGC TT CACGT TACAAGAAGT TT GAT GAAT T CTT CAAAGAACTACTTGAAAA.T GCAGAGAAATC CCTG 
AAT GATAT GTT TGT GAAGACATAT GGCCATTTATACAT GCAAAATT CT GAGCTAT TTAAA.GAT CT CTT CGTAGAGTTG 
AAACGTTACTACGT GGTG GGAAAT GT GAACCTGGAAGAAAT GCTAAAT GACTT CT GGG CTCGC CTCCT GGAGCGGATG 
T TC CGCCT GGT GAACTCCCAGTAC CACT TT ACAGAT GAGTATC T GGAAT GT GT GAGCAAGTATACGGAGCAGCT GAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTT GCGGGAGAT GT CGT GAGCAAGGTCT C CGT GGTAAACCCCACAGC CCAGT GTACC CAT GCC CT GT T GAAG 
AT GAT CTACT GCT CC CACT GCCGGGGTCT CGTGACT GT GAAGC CAT GT TACAACTACTGCT CAAACAT CAT GAGAGGC 
T GTTT GGC CAACCAAGGGGAT CT C GATTTTGAATGGAACAATTT CATAGAT GC TAT GCT GAT GGT GGCAGAGAGGCTA 
GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 
AATAGTGTTCAAGT GT CT CAGAAGGT TTT CCAGGGATGT GGACC CC CCAAGCC CCTCC CAGCT GGACGAAT TT CTCGT 
TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACAT^ 

AGTTT GGACCGACT GGTTACT GAT GT CAAGGAGAAACTGAAACAG GCCAAGAAATT CT GGT CCT CCCTT CCGAGCAAC 
GTTT GCAACGATGAGAGGAT GGCTGCAGGAAAC GGCAAT GAGGAT GACT GTTGGAATGGGAAAGGCAAAAGCAGGTAC 
CTGTTTGCAGT GACAGGAAAT GGATTAGCCAACCAGGGC AACAACC CAGAGGTC CAGGTT GACAC CAGCAAACCAGAC 
ATACT GAT C CTTC GT CAAATCAT GGCTCTT CGAGT GAT GAC CAGCAAGAT GAAGAATGCATACAAT GGGAACGACGTG 
GACTTCTTTGATATCAGTGATGAAAGTAGTGGAGAAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAG 
TTT GACTACAATGCCACT GACCATGCT GGGAAGAGT GC CAATGAGAAAGC C GACAGTGCT GGT GTC CGT C CTGGGGCA 
CAGGCCTACCTCCT CACT GT CTTCT GCATCTT GTT C CT GGTTATGCAGAGAGAGT GGAGATAATT CTCAAACT CT GAG 
AAAAAGTGTT CAT CAAAAAGTTAAAAGGCAC CAGTTATCACTTT TCTACCATC CTAGTGACTTTGCTTTTTAAAT GAA 
T GGACAACAAT GTACAGT TT TTACTATGT GG CCACTGGTT TAAGAAGT GCTGACTT TGTTT TCTCATTCAGTTTT GGG 
AGGAAAAGGGACT GT GCAT T GAGTT GGTTCCT GCTCCCCCAAAC CAT GTTAAACGT GGCTAACAGT GTAGGT ACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TT TTTTT C CAACT GTGAT CT CGCCTT GT TTCTTACAAGCAAAC CAGGGTCC CTTCT TGGCACGTAACAT GTACGTATT 
T CT GAAATATTAAATAGCT GTACAGAAGCAGGTTTTAT TTATCAT GTTAT CTTATTAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTLAVLSAALIJ^ELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 

LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 

QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 

VTVKPCYNYCSNIMRGCIANQGDLDFEWNNFIDAMIJyrVAERLEGPFNIESVMDPIDW 

MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 

EKLKQAKKFWS S LPSNVCNDERMAAGNGNEDDCWNGKGKSRYL FAVTGNGLANQGNNPEVQVDTS 

KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 

KSANEKADS AGVRPGAQAYLLT VFC I L FLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 2* 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCAT CCCCAGGGAT CAGGAGCC TC CAGCAGGGAACC T TCCAT TATATTCT TCAAGCAAC T 
TACAGCTGCACCGACAGTTGCG ATGA AAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAAT T CATGACAG T GT CTGGGC TGCCAAAGAAGCAGT GCCC C TGTGAT CAT T 
T CAAGGGCAATG TGAAGAAAACAAGACAC CAAAGGCACCACAGAAAGC CAAACAAGCATT CCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGC TAGT GT CAT T TAA.CC T TAAATGCAAT CAGGAAAGTAGCAAACAGAAGT CAATAAATAT T T T T 
AAAT GT C AAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVL I S SLLLLLPLMLMSMVS S S LNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 



Important features: 
Signal peptide: 

amino acids 1-22 

|s~ N-myristoylation sites. 

13 amino acids 27-33, 46-52 



■■ " ' 

If] 



fll 



APP_ID=1 0063563 



Page 178 of 320 



FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCCACC TGGAAACACAT TGGGGAT GGCTGCTGCCT CACCAGAGAGACCTGGAAGGATCT T GA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
T TCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTAT CTATCTTCCACTGAGAGGGACCTA 
GCAGAAT GAGAGAAGACAT T CATGTACCACC TACTAGT CCC TCTC TCCCCAACCTC TGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGAC T CAAAGAGGCAGAGGTT T TGT T CTCAAATATT T T T T AATAAATAGAC GAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCT ATGC TGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGC T GGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGC TAGGGGGGCTGCC T TAT T TAAA 
G TGGT TGT T TATGAT TCT T ATACTAAT T TATACAAAGATAT TAAGGCCCT GT TCAT T AAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 

amino acids 1-15 



H ! Growth factor and cytokines receptors family: 

V amino acids 3-18 

o 
III 
y 

If! 
IP 

y 

Q 
If! 
O 
III 

o 
fIJ 
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FIGURE 31 



GT T T GAAT T CCT TCAAC TATAC CCACAGT CCAAAAGCAGAC T CAC TGT G T CCCAGGC TACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
AT TTGCATC TGT T TT GATAAAT GATG T TGACAC CC TCCACC GAAT TC TAAGT GGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
C TGC T CGTCT TGATT T TTGTTC TCAGAAAGAGAATAAAAT T GACAGT TGAGCT T T T CCAAAT CAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GATTC CGAGAATCAT TGTCATGTACAT GCAAAACGCAC TGAAAGAACAGCAGCAT GGTGCAT T GT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAAT GCATATACTACAAC TGC TAT TAAT GGGACAGAT TTCTGTACAT CAGCAAAAGAT GC 
AT T C AAAAT C T T G T CC AAG AAC T CAAG T CAC T T T AC AT C TAT T AAC T GC T T T GGAGAC T T C AT AA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAAT GATGGATCGTCAGAAAAGCCCTAC T T TAT G GAT CAAG AAT T TCTGAGT T T CG TA 
AAAAGGAGCAACAAAT TAAACAATGCAAGGGCACA.GCAGGACAAGCACT CAT TAAGGAATGAGGA 
GGGAACAGAAC TCCAGGCCAT TGTGAGATAGATACCCAT T T AGGTATCTGT ACC T GGAAAACATT 
TCCT T C T AAGAGCCAT T TACAGAATAGAAGATGAGACCACT AGAGAAAAGT TAGT GAAT XT TTTT 
T TAAAAGACCTAATAAACCCTATTCTT CCTCAAAA 
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FIGURE 5*2 

MSGRDT ILGLC I LALALS LAMMFT FRFI TTLLVHI FI SLVI LGLLFVCGVLWWL YYDYTNDLS IE 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVXRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLiytAFNYNRAFQVWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLC FA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 
N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 
N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 

GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAAT T TGTATGGGATT TGTGAT GCAGGAAAGC CTAAGGGAAAAAGAAT AT T CAT T CT GT GT GGT 
GAAAAT TT T T TGAAAAAAAAAT TGCCT T C T T CAAACAAGGGTGTCAT TC T GATAT T TATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT TCAAACAAAGAAACGGCAAAGAAGAT TAAAAGGC C CAAGT TCACT G T GC CTCAGAT CAAC TGC 
GATG TCAAAGCCGGAAAGATCAT CGAT CCT GAGT TCAT TGT GAAATGT CCAGCAGGAT GCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAG T GGTGTGC T TGATAAT T CAGGAGGGAAAATAC T T GT TC GGAAGG T TGC T GGACAGTCT 
GGTTACAAAGGGAGT TAT T CCAACGGT GT CCAAT CGT T AT CCC TACCACGAT GGAGAGAAT CCT T 
TATCGT CT TAGAAAGTAAACCCAAAAAGGGTGTAACC T ACC CATCAGC T CT TACATAC T CAT CAT 
lm . CGAAAAGT CCAGC T GCCCAAGCAGGT GAGACCACAAAAGCCTATCAGAGGCCACC TAT TCCAGGG 

l2 ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
y CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
Q GCCACAGGAGCCAGGAGAT GGAT CTC TGG T CCAC TGC CACCTACACAAGCAGCCAAAACAGGCCC 

fjl AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 
%A GGATGTCAGCCTGGGACTTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCC 
Ti T GGGAGAT CCAAACTGCAAAAT T GAC TTGTCGTTTT TAAT T GATGGGAGCACCAGCAT TGGCAAA 

W CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
V« CGGT CCACT GATGGGT GT T GTCCAGTATGGAGACAACC C T GCTAC TCAC TT TAACCTCAAGACAC 

1|| • ACAC GAAT T CT CGAGATCT GAAGACAGCCATAGAGAAAAT TAC TCAGAGAGGAGGAC T TTCTAAT 

1" GTAGGT CGGGCCATC T CCT T TGTGACCAAGAACT TC T T T T CCAAAGCCAAT GGAAACAGAAGCGG 

1^. GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
h! GACT T GCGAGAGAGTCAGGAATCAACAT T T TCT TCATCACCAT TGAAGG T GCTGC TGAAAAT GAG 

40 AAGCAGTAT GT GGTGGAGCCCAACT T TGCAAACAAGGCCGT GT GCAGAACAAACGGCT T C TACTC 

O GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
Hi ACAC TGACCGCC TGGCCT GCAGCAAGACCT GCT T GAAC TC GGC T GACAT T G GC TT CGT CAT CGAC 

*X GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
l;Z AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
RJ TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 

TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGT TAAT GAT CC TCAT CACCGACGGGAGGTCC TAC GACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGT CAT TGCCACT CACCCCGCCAGAGACCAC TCCT TC TT TGT GGACGA 
G T T TGACAACC TCCAT CAGTATG TCCCCAGGAT CATCCAGAACATT TGTACAGAGT T CAACT CAC 
AGCC TCGGAAC TGAAT TCAGAGCAGGCAGAGCACCAGCAAGTGC T GCT T TACTAAC T GACGT GT T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAAT GTC T T GT TAT TATT CT TT GCCAT CATGCT T T T T CATAT TCCAAAACT T GGAGT TACAAAGA 
TGAT CACAAACG TATAGAAT GAGCCAAAAGGCTACAT CAT GT TGAGGGT GCT GGAGAT T T TACAT 
T T TGACAAT TG T T T TCAAAATAAATGT TC GGAATACAG T GCAGCCC T TACGACAGGCT TAC G TAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T TGT CAT GACAATG TAGGAAT TGC T GAAT TAAATGTT TAGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAAASAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE <\± 

MRT WLTMKAS VI EMFLVLLVTGVHSNKE TAKKI KRPKFTVPQ INCDVKAGKI I DPE FI VKCPAG 

CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 

ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 

TPTTLPRPSPSAAS TTS I PRPQS VGHRSQEMDLWS TATYTSSQNRPRADPGI QRQDPSGAAFQKP 

VGADVS LGLVPKEELSTQS LE PVS LGDPNCKI DLS FL I DGS TS I GKRRFRI QKQLLADVAQALD I 

GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 

RSGAP1WVVVMVDGWPTDKVEEASRIARESGINIFFITIEGAAENEKQYVVEPNFANKAVCR 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 

LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 

FKKSKPNKRKLMIL I TDGRS YDDVRI PAMAAHLKGVIT YAI GVAWAAQEELEVI ATHPARDHS FF 

VDE FDNLHQYVPRI IQNICTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE as 

CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
AC AGAAAACAAC AAAAAACTT AAGCTT T AAT T TC AT CT GG AATT CC AC AGT T T T C TTAGC TC CCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GG AT GT CACT GAGATCCCTC AAAT GGAGCC TCCT GCT GC T GT CACT C C TGAGTT TC TT T GTGATGT GG TAC 
CTC AGC CTT C CC C ACT AC AATGTG AT AG AACGCGTGAACT GGAT GT ACTT CT AT GAGT AT GAGCCGAT T T A 
C AGACAAGAC TT TCACTT C ACACT TCGAGAGCAT T C AAACTGCTCT C ATC AAAATCCAT T TCTGGT CAT TC 

I J* 

;^ TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 

TGGT GGGGAT AT GAGGTT CTT ACATTTT TC TT AT T AGGCC AAGAGG CT GAAAAGGAAGAC AAAAT GTT GGC 
ff% ATTGTCCTT AGAGGAT GAACAC CT TCTT T ATGGT GACAT AATCCGACAAGATT T TT T AGACACAT ATAAT A 

|g| ACCT GACCT T GAAAAC CATT AT GGC AT T C AGGTGGGTAACTGAGT TT TGCCCC AATGCC AAGT ACG TAAT G 

HI AAGACAGACACT GATGTT T T CATC AAT ACT GGCAAT TT AGTG AAGTAT CT T TTAAACCT AAACC ACTC AG A 

ffl GAAGTT TTT C AC AGGT TAT CC T CT AAT T GAT AAT T ATT CCT AT AGAGGAT T TTACC AAAAAACCC ATATT T 

W CTT ACC AGGAGT ATCC TT TC AAGGTGT T CC CTCC ATACT GCAGT GGGT TGGGTT AT AT AATGTCCAGAGAT 

* TTGGTGCC AAGGATCT AT GAAATGATGG GT CACGTAAAACCC AT CAAGTT T GAAGATGT T TATGT C GGG AT 

Jff CTGT TT GAAT TT ATT AAAAGT GAACAT TCAT AT T CC AGAAGACACAAATCTTTT CT TTCT AT AT AG AATCC 

^ ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 
If TGGC AGGTCATGCT AAGGAACACC AC AT GC CAT TAT TAACTT CAC AT T CT AC AAAAAGCC TAGAAGGAC AG 

p| GATACC TTGT GGAAAGTGTT AAAT AAAGT AGGTACT GT GGAAAATTCATGGGGAGGTC AG TGTGCT GGCT T 

jfg*g AC AC TGAACTGAAACT CATGAAAAACCCAGACTGGAGACTGGAGGGT T ACACTT GT GATT TATT AGTCAGG 

CCCT TC AAAGATGAT ATGTGGAGGAATT AAATAT AAAG GAATTGGAGGTT T TTGCT AAAGAAATTAAT AGG 
ACCAAACAAT TT GG AC AT GTCAT T CT GT AG ACT AGAAT TTCT TAAAAGGGTGT T AC TGAGTT AT AAGCTC A 
CT AGGCTGT AAAAACAAAACAAT GTAGAGT TT T AT T TATTGAAC AATGTAGTCACT TGAAGGT TTT GT GT A 
T ATC TT ATGT GGAT TACC AAT T TAAAAAT ATAT GT AGT T CTGTGT CAAAAAACT TCT TC ACT GAAGTT AT A 
CT GAAC AAAAT T TT AC CTGT TTT T GGTC AT TT AT AAAGT ACT TC AAGATGT TGCAGT ATT TCACAGTT AT T 
AT TATT T AAAAT TACT TC AACT T T GT GT TT TT AAAT GT TTTGAC GAT T TC AATACAAG AT AAAAAGGAT AG 
TGAATC AT T C TT T ACATGCAAACATT T TCC AGTT ACTT AACT GATCAGTT T ATT AT TGAT AC AT CACT CCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AAT AT AGAGAAGAATT AAAGCAAG AAAATC TG AAAA 
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FIGURE 36 



MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYWIERWWMYFYEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVWGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 




Important features: 

Type II transmembrane domain: 



amino acids 20-39 



N -glycosylate on sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 




Glycosaminoglycan attachment site. 



Ly-6 / u-PAR domain proteins. 



amino acids 239-243 



amino acids 23-37 



N-myristoylation site. 



amino acids 271-277 
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FIGURE 32 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCT TGCCAAGAGAGTAC ACAGTCAT TAATGAAGCC TGCCC TGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTG 
AATATGATC AGATTGAGTGCGTC TGCCC CGGAAAGAGGGAAGTCGTGGGT TAT ACCATC C C TTGC TGC AGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCCAGGTTCTGCGAGCCCCAAAGGGTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGC 
TAAACCTGGGTT TGT CATCCAAC TAAGAT TTGTCATGT TGAGTC TGGAGTT T GACTACATGTGCC AGTATGACTATGTTGAG 
GT TCGTGATGGAGAC AACCGC GATGGCC AGAT CAT C AAGC GTGTC TGTGGCAACGAGCGGCCAGC TCCT ATC CAGAGCATAG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 

ATGC TCC TC ATCCCC TT GT TT CCAT GACGGCACGT GC GTC CTT GACAAGGC TGGAT C T T ACAAGTGTGCCT GC TTGGCAGGC 

f*% 

TAT AC TGGGCAGCGC TGTGAAAATC TCC T TGAAGAAAGAAAC TGC TCAGAC C CTGGGGGC C CAGTCAATGGGTAC C AGAAAA 
Is! TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 

II i T CTT AGTGGCAAT GAGAAAAGAAC TTGC CAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATC TGCATAAAAGCCTGCCGA 

5 ;: 

III GAACCAAAGATT TCAGACC TGGTGAGAAGGAGAGT T C T TC CGATGCAGGTT CAGTCAAGGGAGACACCATTAC AC CAGC TAT 
y I ACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 
^ I ATACCAAC ATCTGCATACCCAGC TCCAGT AT GAGT GC ATC TCACCCT TC TACCGCCG CC TGGGC AGC AGCAGGAGGACAT GT 

C TGAGGAC TGGGAAGTGGAGTGGGCGGGCAC CATC C TGCAT CCC TATC TGC GGGAAAAT TGAGAACATCAC T GCTCCAAAGA 
% C CCAAGGGTTGCGCT GGCC GT GGCAGGC AGCC ATCTACAGGAGGACC AGCGGGGTGCATGACGGC AGCC TACACAAGGGAGC 

GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
If! GTCACCATGATCAAGACAGCAGACCTGAAAGTTGTTTTGGGGAAATTCTACCGGGATGATGACCGGGATGAGAAGACCATCC 
PI AGAGCCTACAGAT TTC TGC TAT CATTC TGCATCCC AACTATGACCCCATCC T GC TTGAT GC TGACATCGCC ATCCTGAAGCT 

If IJ CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
f| T CCCACATCACT GTGGCTGGC TGGAAT GT CC TGGCAGACGTGAGGAGC C C TGGCTTCAAGAACGACACACTGC GC TC TGGGG 

fgj) T GGTCAGTGTGGT GGACTCGCTGC TGTGT GAGGAGC AGCATGAGGAC CAT GGCATCCCAGTGAGT GTCACTGATAAC ATGTT 

C TG TGCCAGCTGGGAACCCAC TGCCCC T TCT GATATC TGC AC TGCAGAGACAGGAGGCATCGC GGCTGTGT CC TT CCCGGGA 
CGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATAAAACATGCAGCCACAGGCTCTCCA 
C TGCC T TCACCAAGGT GCTGCCT T TTAAAGACT GGAT TGAAAGAAAT ATGAA AT GAA CC AT GC TC ATGCAC T C CT TGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGC T TCTGAC TT CAGGGACAAAAC TC AGT GAAGGGTGAGTAGACC TCCAT TGCTGGT AGGC TGATGCC GC GTCCA 
C TACTAGGACAGCCAAT TGGAAGATGCC AGGGC TT GCAAGAAGTAAG TT TCT TCAAAGAAGACCATATACAAAAC C T C TCCA 
C TC CACT GACC TGGTGGTC T T CCCCAAC T TTCAGT TATAC GAATGCCATCAGC TT GACCAGGGAAGATC TGGGCT TCATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAP IQS I GS SLHVLFHS DGSKNFDGFHAI YEE I TAC S S S PCFHDGTCVLDKAGS YKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALP FGDLPMGYQHLHTQLQYEC I S PFYRRLGS SRRTCLRTGKWS GRAPSCI PICGKIENI TAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVVVAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT I QS LQ I SAI I LHPNYDP ILLDAD I AI LKLLDKAR I S TRVQP I CLAAS R 
DLS TS FQESHI TVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGI PVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 30 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TT T GGGCC CTT GTAG C T GACAGAAGGT GGCCAGGGAGAAT GCAGCACACT GCTCGGAGA ATGA AGGC GCTT CT GT T GC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGT GC CTCC C ACTAC GGC CT GACCAAAGAT AGGAAGAGGC GCT CACAAGAT GGCT GTC CAGACG GCTGT GCGAGCC 
T CACAGC CAC GGCTC C CT CCCCAGAGGTTT CT G CAGCT GCCACCAT CT CCT TAAT GACAGAC GAG C CT GG CCTAGACA 
AC CCT GCCT ACGT GT CCT CGG CAGAGGACGGGCAGCCAGCAAT CAGCC CAGT G GACTCT GG CCGGAGCAACC GAACTA 
G GGCAC GGCC CTTT GAGAGAT CCACTATTAGAAGCAGATCAT TTAAAAAAATAAAT C GAGCTTT GAGT GT T CTTCGAA 
GGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATTCTGAAAACACCACTGCCCCTGAAG 
TCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTACCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAA 
GCCTCTCTATTAGGCTGGTGGGAGGTAGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGG 
T GATC GCCAGAGACGG C CGGCT ACT GC CAGGAGACAT CAT T CTAAAG GT CAACGGGAT GGACAT CAGCAAT GT CCCTC 
ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
GCAGGAACAAT GGACAGGCC CCGGAT GCCT ACAGAC CCC GAGAT GACAGCTTT CAT GT GATTCT CAACAAAAGTAGCC 
CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GTGTGGCATATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCGATATGGCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAGC CCT GACAT CTT T CAGGAAGCCGGCT GGAACAGCAAT GGCAGCT G GT CC CCAGGGC CAGGGGAGAGGAG CAACA 
CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 
GCATGACC GT CGCAGGGGGAGCAT CACATAGAGAAT GGGATTT G C CTAT CTAT GTCAT CAGTGT T GAGC CCGGAGGAG 
T CATAAGCAGAGAT GGAAGAATAAAAACAGGTGACATTTT GTT GAATGT GGAT GG GGT C GAACT GACAGAGGT CAGCC 
GGAGT GAGGCAGT GGCAT TATTGAAAAGAACAT CAT CCT CGATAGTACT C AAAGCT TT G GAAGTCAAAGAGTAT GAGC 
C CCAGGAAGACT GCAGCAGCCCAGCAGC C CT GGACT CCAAC CACAACATGG CC CCACC CAGTGACT GGT C C CCATC CT 
GGGT CAT GTGGCT GGAATT ACCAC GGT GCT T GTATAACT GTAAAGATATT GTATTACGAAGAAACACAGCT GGAAGTC 
T GGG C T TCT GCATT GTAGGAGGT TAT GAAGAATACAAT GGAAACAAACCTTTTTT CAT CAAAT CCATT GTT GAAGGAA 
CAC CAGCATACAAT GATGGAAGAATTAGAT GTGGT GATAT T CTT CTTGCT GTCAAT GGTAGAAGTACAT CAGGAAT GA 
TAC AT GCTT GCTT GGCAAGACTG CT GAAAGAACTTAAAGGAAGAATTACT CTAAC TATT GTTT CTT GGC CTGGCACTT 
TTTT ATAGA AT CAAT GATGGGTCAGAGGAAAACAGAAAAAT CACAAATAG GCTAAGAAGTT GAAACACTATAT T TATC 
T TGT CAGTT TT TATATTTAAAGAAAGAATACATT GT AAAAAT GT CAGGAAAAGTAT GAT CATCTAAT GAAAGCCAGTT 
ACACCTCAGAAAATAT GATT CCAAAAAAATTAAAACTACTAGT TTTTTTT CAGT GT GGAGGATTT CTCAT TACT CTAC 
AACATTGTT TATAT TT TTT CT AT T CAATAAAAAGCCCTAAAACAACTAAAATGATT GATTT GTATACC CCACT GAATT 
CAAGCT GATTTAAATT TAAAATT T GGTATAT GCT GAAGT CT GC CAAGGGTACAT TATGGC CATT TTTAATT TACAGCT 
AAAAT ATTTTTTAAAAT GCATTGCTGAGAAACGT T G CT TTCAT CAAACAAGAATAAATATT TTT CAGAAGTTAAA 
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FIGURE 4Q 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHI I IQHIYRDGVIARDGRLLPGDI I LKVNGMD I SNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEWEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features : 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE Ai 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
T TTGGAAGAGGT ACAGGCAGAAAAT CCCAAAGT GT CCAGAGGCCGGTAT CGCCCT CAGGAATGTAAAG 
f*k CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
O CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 

* sl T AAAAAGT TTAATCGAGCCAAACT CTTGAATGT GGGCTAT CTAGAAGCCCT CAAGGAAGAAAAT T GGG 

#1 

j* s ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
IP CATCCCAAGCATCTGGTGGT T GGCAGGAACAGCACTGGGTACAGGTTAC GT TACAGT GGATATT T TGG 

fl| GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
III GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
^ CCTGAAGTGGGTAAATATACAATGGTGTT CCACACTAGAGACAAAGGCAATGAGGT GAACGCAGAAC G 

7jf GATGAAGCTCT TACACCAAGTGT CACGAGTCT GGAGAACAGAT GGGTTGAGTAGTTGTTCT TAT AAAT 

J: TAGTATCTGTGGAACACAAT CCT T TATATATCAACAT CACAGT GGATTT CT GGTTTGGTGCATGACCC 

iss TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
fl ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
flf TAGCAGAGCTCCT GGTGAT GTAGAGTATAAAACAGT T GTAACAAGACAGCT TT CT TAGTCATT TTGAT 

CAT GAGGGT TAAATATTGTAATATGGATACTT GAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AAT GAACGCTATT TGAGGACTCT GGTTGAAGGAGAT T TATTTAAATTTGAAGTAATATAT T ATGGGAT 
AAAAGGCCACAGGAAATAAGACT GCTGAATGT CTGAGAGAACCAGAGTT GTT CTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTT ATTCAT TTATCCT GTACAAT CATCTGTGAAGT GGT GGT GTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGAC GAAT CAGGACACAGT GAACT TGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTTTAAAGAGTTTTTGTAAAATGATTTTGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACAT 
ATTAACTAATAATAAATATGTCTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 



APP_ID= 



10063563 



Page 193 of 320 



FIGURE 42 

MGFNLT FHLS YKFRLLLLLTLCLTWGWATSNYFVGAI QE I PKAKEFMANFHKTLI LGKGKTLTN 
EAS TKKVELDNCPSVSPYLRGQSKL I FKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAI LVPH 
RNREKHmYLLEHLHPFLQRQQLDYGIWIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEWAERMKLLHQVSRVWRTDGLSSCSYKLV 
S VEHNPLYINI TVDFWFGA 

Important features : 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 



GC TCAAGACCCAGCAGT GGGACAGCCAGACAGACGGCACGATGGCAC T GAGC TCCCAGAT C T GGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACG TAGA ACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 44 



MMiSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 



cAMP- and cQ4P-dependent protein kinase phosphorylation site. 

amino acids 58-59 





N-myristoylation site. 




amino acids 44-50 



Prokaryotic membrane lipoprotein lipid attachment site. 




amino acids 1-12 
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FIGURE 45 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCT CT TGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAATGAC TCAGGGATCTAC TATG T GGGGATATACAGC T CAT CAC T CCAGCAGCCC T CCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
Q AT AAGAATGGCACC T GT GT GACCAATCT GACAT GC TGCATGGAACATGGGGAAGAGGATGT GAT T 

O TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
yl C TGGAGATGGGGAGAAAGT GATATGACCT TCAT CTGC GT T GCCAGGAACCCTGTCAGCAGAAAC T 

Tt TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
ifi GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
|y G T T TC TGAAGAGAGAGAGACAAGAAGAGTACAT TGAAGAGAAGAAGAGAG T GGACAT T TGTCGGG 

3 AAAC TCCTAACATAT GCCCCCAT TC TGGAGAGAACACAGAGTACGACACAATCCCTCACAC TAAT 

CI AGAACAAT CCTAAAGGAAGAT CCAGCAAATACGGTT T ACT CCAC TGTGGAAATACCGAAAAAGAT 

« GGAAAAT CCCCACTCACT GCTCACGAT GCCAGACACACCAAGGC TATT TGCC TAT GAGAAT GTTA 

PI TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 

III 
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FIGURE a6 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPIl^KLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
C TGCAACAACAGAAC TGGAATGT T TCT T T CAT CAT T TTT CAGT GTGATCACAGT CAT TGG T GCT C 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT TG TGAAT TTT CATT GAAAAACAT CAGT GACAT TCAT CCAGAAT CC T TCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
J^f ACACCAT GGCGAGTGGC TGGAGAGCATC TAGT TTCCACT T CGAT TC T GAAGAAAACAAACATAGG 

rf| CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
l|| CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
Iff TGTAGTTTAATGGGAATAAAATGTAAGTATCAGTAGTTTGAAAAAAAAAAA 

ff! 
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FIGURE 48 



MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 



cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 223-227 




N-glycosylation sites. 



amino acids 80-84, 132-136, 148-152, 163-167 



Si;;! 



N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 




Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 



TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE aq 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAG AAGCAGCACAGTC C TG TACC T GAGAAGGCC AT C CCAC TCAT CAC T CCAGGC T C T GCCA 

CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE *o 

MERVTIJVLLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
S QKQHS PVPEKAI PL I T PGS AT TC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 

GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCC TGGTGGGAGGAAGACAC TCT GGAGAGAGAGGGGGC TGGGCAGAGATGAAGT TC CAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AG C AGAT G C T T T GGG CAACAGGG TC G G GGAAG C AGC C CAT G C T C T GG GAAAC AC T GGG C AC GAGA 
TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 
GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
AGCCAGCAACCAGAAT GAAGGGTGCACGAATCCCCCACCATCTGGCTCAGG T GGAGGC T CCAGCA 

ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGG71AATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 

MKFQGPIACLLL^CLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 



m 



Signal peptide: 

amino acids 1-21 

N-glycosylation site. 

amino acids 265-269 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE S3 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATOTCGCTGCTGAGCCTGCCCTGG 

CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 

ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 

CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 

TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 

CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 

TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 

AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 

GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 

CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 

CAGAAAT GCATC T TCAGCT TTGACAGCCAT TGTCAGGAGAGGCCCAG T GAATATATTGCCACCAT 

CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 

ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 

GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 

AGCCAAGTCCAAGAC TT TGGATT TCAT TGATGT GC T T CT GC TGAGCAAGGATGAAGAT GGGAAGG 

CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 

GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 

ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 

AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 

TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 

CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 

CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 

GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 

GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 

GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 

CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 54 



MSLLSLPWLGLRPVAMSPWLLLLLWGSWLIARIIA^ 

L I TPTEEGLKDS TQMSATYSQGFTVWLGPI I PFI VLCHPDT IRS I TNASAAI APKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFI SRCCTQDI VLPDGRVI PKG I TCL I D 1 1 GVHHNPTVWPDPE VYDP FRFDPENSKGRS P 
LAFI PFSAGPRNC IGQAFAMAEMKWLALMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLN 
VGLQ 



Important features: 



Transmembrane domains: 



amino acids 13-32 (type II), 77-102 




N-glycosylation sites. 

amino acids 112-116, 168-172 



Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 
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FIGURE 5 5 

ATCGCAT CAAT TGGGAGTACCAT CT TCCT C ATG GGACCAGT GAAACAGCT GAAGCGAAT GT T T 6A 
GCCTACTCGTTTGATTGC7VACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT TCAT GGCCAGT T T TATGAAGCT TT GGAAGGCACTATGGACAGAAGCT GGT GGACAGT TT T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
G T GAT T CGAACAT TTGAGGGT TACT T T TGGAAGCAACAATACAT TC T CGAACCTGAAT GTCAG TA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCTGGATGTTGTCCCACTGAATTCCCATGAATACAAACCTATTCAGCAACAGCAAAAA?U\AAAAA 
AAAAAAAAAAAAAAAAAAA?y\AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAi^^ 
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FIGURE *6 



MGPVKQLKRMFEPTRLIAT IMVLLCFALTLCSAFWWHNKGLALI FC ILQSLALTWYSLS FI PFAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

H 

Q Type II fibronectin collagen -binding domain protein. 

O amino acids 30-72 

fi 
w 
111 
if! 
lil 

o 

If! 
O 

ru 
o 
ru 



APP_ID=1 0063563 Page 208 



FIGURE 57 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTT T CAAGCT CAGTGTCT TCAT CCCCT C CC AGGAAT T CT C CACCTACC GC CAGT GGAAGCAGAAAAT T GTACAAGCT 
GGAGAT AAGGAC CTTGAT GGGCAGCTAGACT T T GAAGAATTT GT CCATTATCT C CAAGATCAT GAGAAGAAGCT GAGG 
C T GGT GTT TAAGATTTT GGACAAAAAGAATGAT GGACG CATT GACGCGCAGGAGAT CAT GCAGT CCCT GCGGGACT T G 
GGAGTCAAGATAT CTGAACAGCAGGCAGAAAAAATT CT CAAGAGCATGGATAAAAACGGCAC GAT GACCAT C GACT GG 
AAC GAGT GGAGAGACT AC CACCT CCT C CACC CC GTGGAAAACAT CC CCGAGATCAT CCT CTACT GGAAGCATT CCAC G 
AT CT TT GAT GT GGGTGAGAAT CTAACGGTCC C GGAT GAGTT CACAGT GGAGGAGAGGCAGAC GGGGAT GTGGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAG GTCCAT GC CT CCCGCAGCAACAACATGGGCATC GTT GGT GGCTT CACTCAGATGAT TCGAGAAGGAGGG GC CAGG 
TCACTCTGGCGGGGCAATGGCATCAACGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAG 
l Jn ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 

ATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCGGAAGACAGGCCAGTACTCAGGA 
C!j AT GCTGGACTGC GCCAGGAGGAT C CT GGCCAGAGAGGG GGT GGCCGCCT TCT ACAAAGGCTAT GTCC CCAACAT GCT G 

f*% GGCATCAT CCC CTATGCC GGCAT C GAC CTTGCAGTCTAC GAGACGCT CAAGAAT GCCT GGCT GCAGCACTAT GCAGT G 

AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
Iff CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
| . | TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
Yz, GCTGTGAGCAT CAGCTACGTGGT CTAC GAGAACCTGAAGAT CACCCT GGGCGTGCAGT CGCG GTGA C GGGGG GAGGGC 

If I CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
m AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
?' I GT C CTGCT GACC CCAGCAGACCCT CCT GTTG GT T CCAGC GAAGACCACAGGCATTCCTTAGG GT CCAGGGT CAGCAGG 

HI CTCC GGGCT CACAT GT GTAAGGAC AGGACATTT T CT GCAGT G CCTGCCAATAGT GAGCT TGGAGCCT GGAGGC C GGCT 

s TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 

CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
Csl ATAATCCAT GAT GAAAGGT GAGGT CACGTGG C CT CC CAG GCCTGACT T CCCAAC CTACAGCAT TGAC GC CAACT TGG C 

Iff T GT GAAGGAAGAGGAAAGGAT CT GGCCTT GT G GT CACT G GCAT CTGAGCCCT GCTGAT GGCT GGGGCT CTCGGGCAT G 

%l CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
O GGC CTGGACCCT GT CAGGATGGGC CCCAC CT CAGAACCAAACT CACT GT CC CCACT GT GGCAT GAGGGCAGT GGAGCA 

fH CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
t.;! ACT GTT GG GAAAAGGGT TTTGT CCAGAAGGACAAGC CGGACAAATGAGCGACTT CT GT GCTT CCAGAGGAAGACGAGG 

|J GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
fjl GGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGTTATGTCCT 

AACTAT TTTTATAGAT T T GTT TAATTAATAGCTTGT CATTT T CAAGT T CATTT TTTATTCATATTTAT GTTCAT GGTT 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GT CCAGAGAAATT CCTT TT GGGACTGGAGGCAGAAAAGC GGCCAGAAGGCAGCAGCCCT GGCT C CTTT CCTTT GGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
TGAGGAAAT CT T CAATAGGATGCAAAGAT CAAT GCAAAAATTGTTATATAT GAACATATAACT G GAGT C GT CAAAAAG 
CAAATTAAGAAAGAATT GGACGTTAGAAGTT GT CATTTAAAGCAGC CTT CTAATAAAGTTGTTTCAAAG CT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *8 

MLCLCLYVPVI GEAQTE FQ YFE S KGL PAELKS I FKLS VFI PSQE FS T YRQWKQK I VQAGDKDLDG 
QLDFEE FVHYLQDHEKKLRLVFK I LDKKNDGR I DAQE IMQS LRDLGVKI SEQQAEK I LKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE 1 1 LYWKHS T I FDVGENLT VPDE FT VEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
I LARE GVAAFYKG YVPNMLG 1 1 PYAG I DLAVYE TLKNAWLQH YAVNSADPGVFVLLACGTMS S TC 
GQIASYPLALTOTRMQAQASIEGAPEVTMSSLFKHILRTEGAFGLYRGIAPNFMKVIPAVSISYV 
VYENLKI TLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE zq 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT T GGGGAGGAT GGAATCCT GAGCTGCACT T TT GAACC T GACAT CAAAC T TT CT GATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CATCACT T CTAAAGGCAAGGGGAAT GCTAACC T TGAG TATAAAAC TGGAGC C TTCAGCATGCCGG 

r% 

IZ AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
m CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
IJ CAGCTTT GAGC TGAAC TCT GAGAAT GTGACCATGAAGGT T GTGT C TG TGC T C TACAATGT TACGA 

Ifl TCAACAACACATACTCCTGTATGAT TGAAAATGACAT TGCCAAAGCAACAGGGGATATCAAAGTG 

f I ACAGAAT CGGAGATCAAAAGGCGGAG T CACCTACAGC T GC TAAAC T CAAAGGC T TCTCTGTGTGT 

lil 

I"" CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 

M GTGC CT T GGCCACAAAAAAGCATGCAAAG T CAT TGT TACAACAGGGAT CTACAGAAC TAT TT CAC 

III CACCAGATATGACCTAGT TTTATAT T TCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTG 

0 AGCAAAC AAGAGCAAGAAAC AAAAAGAAGC C AAAAGCAGAAGG C T C CAAT AT GAACAAGAT AAAT 

1 C TAT CT TCAAAGACATAT TAGAAGT TGGGAAAATAAT TCAT GTGAAC T AGACAAGTGT GT TAAGA 

o 

?g g GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 

CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAAACAGAGCAGT 
CGGGGACACCGAT TT TATAAATAAACTGAGCACCT TCT T T T TAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVOTDYNASSETLRCEAPRWFPQPTVVWASQVDQGANFSEVS 
NTSFELNSEWTMKWSVLYWTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS S FFAI S WALLPLS PYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

amino acids 258-281 
N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGAATCACC ATG GCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
AT TGGTCTTCATTCAAT GATGAGACC TGCCTCATGATGATAAACATGT TTGACAAGACCAAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGC CT TCCGGGAGAAGGACACAGC TGTACAAGGCAACATCCGGC TCAGCT TCGAGGAC T T CGTCA 
CCATGACAGC T TC TCGGAT GCTATGACCCAACCATCT GT GGAGAGTGGAGT GCACCAGGGACCT T 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGA&GTTGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
C TGACCAAGCATGAGAGAGATC TGTC TATGGGACCAGT GGC T TGGAT TC TGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T TGGGGCCAAAAG T CCAGTGAAAT T GTAAGC T TCAATAAAAGGAT GAAACTC TGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCI^INMFDKTKSGRIDVYGFSALWKFIQQWKNLFQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites, 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 

CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 

ass 

r% TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
O TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
W CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 

?Zf TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 

111 

m ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 

I|j CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 

3 GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 

Jff CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

Jjf GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 

fia TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 

fj TGCTCATCTCCAATAAAATAAAAGCACTTATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

flj AAAAAAAAAAAAAAAAAAAAMAAAAAAAAA^ 
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FIGURE 6a 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoYlation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6* 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACT ATGG GGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACAC ATGAC AGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HEARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 



N-myristoylation sites. 



r| N-glycosylation site. 

O amino acids 37-41 

CP 

w 
If I 

HI amino acids 15-21, 19-25, 60-66 

W 

o 

If! 
O 
III 

a 

ry 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
Q GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
ff! GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
W AGAC T CTGCCCC TGAGGTCAAGAGAGGAT GGGGC TAT TCAC TTT T ATATAT TTATATAAAAT TAG 

TAGT GAGATGTAAAAAAAAAAAAAAAAAA 

■TZ'r, 
til 
S 
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FIGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site, 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



GCCAGGAATAACTAGAGAG GAACA ATG GGGT TATT CAGAGGTTTT GTT TTCCT CTTAGTT CTGT GCCT GCT G CACCAG 
T CAAATACTT CCTT CATT AAGCT GAATAAT AAT GGCTT T GAAGAT ATT GTCAT T GT TATAGAT CCTAGT GT GCCAGAA 
GAT GAAAAAAT AAT TGAACAAATAGAG GATATGGTGACTACAGCT T CTACGTACCT GTT TGAAGCCACAGAAAAAAGA 
TTT TT T TT CAAAAATGTAT CTATATTAATT CCTGAGAATT GGAAGGAAAAT CCTCAGTACAAAAGGCCAAAACATGAA 
AACCATAAACAT GCT GAT GTTATAGTTGCACCACCTACACTCCCAGGTAGAGAT GAAC CATACAC CAAGCAGTTCACA 
GAAT GT GGAGAGAAAGGCGAATACATTCACTTCAC C CCT GACCT T CTACT T GGAAAAAAACAAAAT GAATAT GGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
TACC GT GCT AAGT CAAAAAAAAT CGAAGCAACAAGGT GTT C CGCAGGTATCT CT GGTAGAAATAGAGT TTATAAGT GT 
CAAGGAGGCAGCT GT CTTAGTAGAGCAT GCAGAATT GATTCTACAACAAAACT GTAT GGAAAAGAT TGT CAATT CT TT 
CCTGATAAAGTACAAACAGAAAAAGCATCCATAATGTTTATGCAAAGTATTGATTCTGTTGTTGAATTTTGTAACGAA 
AAAACC CATAAT CAAGAAGCT CCAAGCCTACAAAACAT AAAGT GCAATTTT AGAAGTACAT GGGAG GT GATTAGCAAT 
1U: T CT GAGGAT TT T AAAAACACCATACCCAT GGT GACACCACCT CCTCCAC CT GT CTT CT CAT T GCTGAAGAT CAGT CAA 

£«g AGAAT T GT GT GCTTAGTT CT T GATAAGT CT GGAAGCAT GGG GGGTAAGGACCGCCTAAATCGAATGAAT CAAGCAGCA 

AAACATT T CCTGCT GCAGACT GTT GAAAATGGAT CCT GGGT GGGGAT G GTT CACTTT GATAGTACT GCCACTATT GTA 
O AATAAG CTAATCCAAAT AAAAAG CAGT GAT GAAAGAAACACACT CAT G GCAGGATTACCT ACATAT CCT CT GG GAGGA 

fl% ACT T CCAT CT GCT CTG GAATTAAATAT GCAT TT CAG GT GATT GGAGAGCTACATTCCCAACTC GAT GGATCCGAAGTA 

f I CTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTGAAACAAAGTGGGGCCATTGTTCAT 

||j TTTAT T GCT TTGGGAAGAGCT GCT GAT GAAGCAGTAAT AGAGAT GAGCAAGAT AACAGGAGGAAGT CATTTTTATGTT 

Iff T CAGAT GAAGCT CAGAACAAT GGCCT CAT TGAT GCTT T T GGGGCT CTTACAT CAGGAAATACT GAT CT CT CC C AGAAG 

J; T CCCTTCAGCT C GAAAGTAAGG GATTAACACT GAATAGTAAT GC CT GGAT GAAC GACACTGT CATAAT T GATAGTACA 

CP GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
| 5 a AT AAT GGAAAAT TT CACAGTGGAT GCAACTT CCAAAAT GGCCTAT CT CAGTAT T CCAGGAACT GCAAAGGTGGGCACT 

T GGG CATACAAT CTTCAAGCCAAAGC GAACCCAGAAACATTAACTAT TACAGTAACTTCT CGAGCAGCAAATT CTT CT 
3 GT GC CTCCAAT CACAGT GAAT GCTAAAAT GAAT AAGGAC GTAAACAGTTTCCC CAGC C CAAT GATT GT TTACGCAGAA 

ATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCATTGAATCACAGAATGGACATACAGAAGTT 
T T GGAACT TTT GGATAAT GGT GCAGGCGCTGATT CTT T CAAGAATGATGGAGT CTACT CCAGGTAT T TTACAGCATAT 
|| I ACAGAAAAT GGCAGATATAGCT TAAAAGT TCGGGCT CAT GGAGGAGCAAACAC T GC CAGGCT AAAATTACGGCCTCCA 

CTGAATAGAGCC GC GTACATACCAGGCT G GGTAGTGAACGGG GAAATT GAAG CAAACC CGCCAAGACCT GAAATTGAT 
jlf;! GAG GATACT CAGAC CAC CTTGGAG GATTT CAGCC GAACAGCATCCGGAGGT GCATTTGT GGTAT CACAAGT CCCAAGC 

ClJ CTTCCCTTGCCTGACCAATACCCACCAAGTCAAATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTT 
f«| ACAT GGACAGCAC CAG GAGATAATTTT GAT GTT GGAAAAGTT CAACGTTATATCATAAGAAT AAGT GCAAGTAT TCT T 

Jf; GAT CTAAGAGACAGTTT T GAT GAT GCTCT T CAAGTAAATACTACTGAT CTGT CAC CAAAGGAGGCCAACT CCAAGGAA 

I'll AGCTTTGCATTTAAACCAGAAAATATCTCAGAAGAAAATGCAACCCAC^TATTTATTGCCATTAAAAGTATAGATAAA 

AGCAATTTGACAT CAAAAGTAT C CAACAT TGCACAAGTAACT TTGTTTATCCCT CAAGCAAAT CCT GAT GACAT T GAT 
CCTACACCTACTCCTACTCCTACTCCTACTCCTGATAAAAGTCATAATTCTGGAGTTAATATTTCTACGCTGGTATTG 
TCT GTGATT GGGT CT GTT GTAATT GTTAACT TTATTT TAAGT AC CAC CATT TGAA CCTTAAC GAAGAAAAAAAT CTT C 
AAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGTAAAGGATATTTCTGAATCTTAAAATTCATCCCAT 
GT GT GAT CATAAACT CAT AAAAATAATT TTAAGATGT CGGAAAAGGATACT TTGATTAAAT AAAAACACT CAT GGAT A 
T GTAAAAACTGT CAAGATTAAAATTTAATAGT TT CAT TTAT TTGTTAT TTTATTT GT AAGAAATAGT GAT GAACAAAG 
AT CCTTTTTCATACT GATACCT GGTT GTATAT TATTTGATGCAACAGT TTT CTGAAAT GATATT T GAAATT GCAT CAA 
GAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTAAAGGAGAGCAAATAAACAACATTTGG 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 70 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKAS IMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMVHFDSTATIWKLIQIKSSDERNTmAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGL I DAFGALT SGNT DLS QKSLQLE SKGLTLNSNAWMNDTVI IDS TVGKDT FFL 
I TWNSLPPS I S LWDPSGTIMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 
AANS SVPP I T VNAKMNKDVNS FPS PMIVYAE ILQGYVPVLGANVTAFI E S QNGHTEVLELLDNGA 
GADSFKNDGWSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVVNGEIEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYIIRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 



FIGURE 71 



CT CCTT AG GT GGAAAC C CTGGGAGTAGAGTACT GACAGCAAAGAC CGGGAAAGACCATAC GTCCCC GGG CAGGGGT GA 
CAACAGGT GT CAT CTT TT TGAT CT CGTGT GT GGCT GCCTTC CTATT TCAAGGAAAGAC GCCAAGGTAATT TTGACC CA 
GAGGAGCAAT GAT GTAGCCAC CT CCTAACCT T CC CTT CTT GAACC C CCAGT TAT GC CAGGATT TACTAGAGAGTGT CA 
ACT CAACCAGCAAGC GGCT CCTT CGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGT GGG GCTGCGTTCTCTTAGCA 
GT GCT CAGAAGT GACT TG CCT GAGGGTGGACCAGAAGAAAGGAAAG GTCCCCTCTTGCTGTT GGCT GCACATCAGGAA 
GGCT GT GATGGGAAT GAAGGTGAAAACTT GGAGATTT CACTTC AGT CATT GCT T CT GCCT GCAAGAT CAT CCTT TAAA 
AGTAGAGAAGCTGCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGATGCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTT TT GCT GGTGCT CCTCT G C T GT GCTAT CT CT GT C CT GT ACAT GT T G GCCTGCAC CC CAAAAGGTGACGAGGAGCAG 
CTGGCACT GCCCAGGGCCAACAG CCCCACGGGGAAGGAGG GGTACCAG GCCGT C CT TCAGGAGT GGGAGGAGC AG CAC 
CGCAACTACGT GAGCAGCCT GAAGC GGCAGATCGCACAGCT CAAGGAGGAGCT GCAGGAGAGGAGT GAGCAGCT CAGG 
f = * AAT GG GCAGTACCAAG CCAGC GAT GCTGCT GGCCT GGGT CT GGACAGGAGCCCC CCAGAGAAAAC CCAG GCCGAC CT C 

jp CTGGCCTTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGCAGTG 
Hi C CTTT C GATAGCT TT ACT CTACAGAAGGT GTACCAGCT GGAGACT GGCCT TACC CGCCAC C CCGAGGAGAAGCCT GTG 

W AGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACCCTGAACAATCCTGCAGAGAACAGC 
|f| CCCAAT CACCGTCCTTACACGGCCTCTGATT T CATAGAAGGGAT CT ACCGAACAGAAAGGGACAAAGG GACATT GTAT 

GAGCT CAC CTT CAAAGGG GACCACAAACAC GAATT CAAACGGCT CATCTTATT T CGACCAT TCAGCCC CAT CAT GAAA 
^ GT GAAAAAT GAAAAGCTCAACAT GGCCAACACGCTTAT C AATGTT ATCGT GCCT CTAGCAAAAAGGGT GGACAAGTTC 

HI CGGCAGTT CAT GCAGAATTT CAGGGAGAT GT GCATT GAGCAGGATGGGAGAGT CCATCT CACTGTT GT TTACTT T GGG 

ffi AAAGAAGAAATAAATGAAGT CAAAGGAATACTT GAAAACACTT CCAAAGCTGC CAACTT CAGGAACTT TACCTT CATC 

CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
l|| TTTTTCTGTGATGTGGAC^TCTACTTCACATCTGAATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAG 

GTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 
r ^ CAGCAGCT G GT CATAAAGAAGGAAACTGGATTT T GGAGAGACT TT GGAT TT GGGAT GAC GT GT CAGTATCGGTCAGAC 

Q TTCAT CAATATAGGTGGGTTT GAT CT GGACAT CAAAGGCT GGGG CGGAGAGGAT GT GCACCTT TAT C GCAAGTAT CTC 

5 «B CACAGCAACCTCATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAG 
^"l CTGAC CCC CGAGCAGTACAAGAT GTGCAT GCAGT C CAAGGC CAT GAACGAG GCAT C CCACGGCCAGCT GGGCAT GCTG 

O GTGTTCAGGCACGAGATAGAGGCTCACCTTCGCAAACAGAAACAGAAGACAAGTAGCAAAAAAAC ATGAA CTCCCAGA 
sTg GAAG GATT GTGGGAGACACT TT TT CTTT CCT TTT GCAATTACT GAAAGTGG CTGCAACAGAGAAAAGACTT CCATAAA 

*M GGAC GACAAAAGAATT GGACT GAT GGGT CAGAGAT GAGAAAGC CTCC GATT T CT CT CT GTT GGGCTTTTTACAACAGA 

Q AAT CAAAATCT CC GCT TT GCCTGCAAAAGTAACC CAGTT GCAC CCTGT GAAGT GTCT GACAAAGGCAGAAT GCTTGT G 

i$\ AGATTATAAGC CTAATGGT GT GGAGGTTTTGATGGT GTT TACAATACACTGAGACCT GTTGTTTTGT GT GCT CATT GA 

AATATT CAT GATTTAAGAGCAGTTTT GTAAAAAATT CAT TAGCATGAAAGGCAAGCATATT T CTCCTCATAT GAAT GA 
GCCTAT CAGCAGGGCT CTAGTTT CTAGGAAT G CTAAAAT AT CAGAAGGCAGGAGAGGAGATAGGCT TATTAT GATACT 
AGT GAGTACAT TAAGTAAAATAAAAT GGACCAGAAAAGAAAAGAAACCATAAAT ATC GT GT CATATTT T CC C CAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTT TT CC CTT GT GAGTTATAGT CTGCTTATTTAATTACCACTTTGCAAGCCT TACAAGAGAGCACAAGTT GGCCT AC 
ATT TTTATATT TTTTAAGAAGATACTTT GAGAT GCAT TATGAGAACTTT CAGT T CAAAGCAT CAAAT T GAT GC CAT AT 
CCAAGGACAT GCCAAAT GCT GATT CT GT CAGGCACT GAATGT CAGGCATTGAGACAT AGGGAAGGAAT GGT T TGTACT 
AATACAGACGTACAGATACTTT CT CTGAAGAGTATTT TCGAAGAGGAG CAACT GAACACTGGAGGAAAAGAAAAT GAC 
ACT TTCTGCTTTACAGAAAAGGAAACT CATTCAGACT GGTGATATCGT GAT GTACCTAAAAGT CAGAAACCACATTT T 
CTCCT CAGAAGT AGGGACCGCT TT CTTACCT GTTT AAATAAACCAAAGT ATACC GT GT GAAC CAAACAATCT CTTTT C 
AAAACAGGGTGCT CCTCCT GGCTT CT G GCTT CCAT AAGAAGAAAT GGAGAAAAATATATATATATATATATATATTGT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
TAACTGAAT TATT T TTTAAATTAAGCAGT TCTACTCAAT CACCAAGAT GCTT CT GAAAATT GCATT TTATTAC CATT T 
CAAACTATT TTTTAAAAATAAATACAGTTAACATAGAGT GGTTT CT T CATT CAT GT GAAAAT TATTAGCCAGCACCAG 
AT GCAT GAGCTAATTAT CT CTT T GAGT C CTT GCTTCT GTTT GCT CACAGTAAACTCATT GT T TAAAAG CTT CAAGAAC 
ATT CAAGCTGTT GGTGT GTTAAAAAAT GCAT T GTATT GATTT GTACT GGTAGT TTATGAAATTTAATTAAAACACAGG 
CCAT GAAT GGAAGGTGGTATT G CACAGCTAAT AAAATAT GATTT GT GGATAT GAA 
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FIGURE 72 

MMMVRRGLIAWI SRVWLLVLLCCAI SVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 

AEWAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 

ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHE FKRLILFRPFSPIMKVKNEKLNMAN 

TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNF 

TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 

NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 

KYLHSNLIVVRTPTOGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEM 

RKQKQKT S S KKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 



GAGAC TGCAGAGGGAGATAAAGAGAGAGG GCAAAGAGGCAGCAAGAGAT T T GTCC TGGGGAT CCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
s , GGAAACAATGCCCAGTCGCCCAT CGATAT T CAGACAGACAGTGT GACAT T T GACCCTGAT T T GCC 

Q TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
O ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
CH GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
ft TGAAGCCACAT T TGCAGAGCTCCACAT T GTACAT TATGACT CT GAT T CCTATGACAGC T TGAGTG 

f& AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
! % I AAT ATAGCT TAT GAACACAT TCT GAGT CACT TGCATGAAGTCAGGCATAAAGAT CAGAAGACC TC 

M AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
%'J GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
^ AT T T CAAT GGAACAGC TGGAAAAGCT T CAGGGGACAT T GT TCTCCACAGAAGAGGAGCCCTC TAA 

GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
C C GAAAGAG TGTGGTCTT C AC C T C AG C AC AAGC C AC GAC T GAG G C ATAAA T T C C T T C T C AGAT AC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACAC TGTAGGAGTAGTAAGCAGAT GTCC TCC T TCCCCT GGACAT C TCTTAGAGAGGAAT 
GGACCCAGGCTGTCAT TCCAGGAAGAACTGCAGAGCC T TCAGCCTCTCCAAACAT G TAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C T CCT TAGGATAAAGAGT TGCT GT TGAAGT T GTATAT T T T T GAT CAAT AT AT T TGGAAAT TAAAG 
TTTCTGACTTT 



m 
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FIGURE 7 A 



MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFAS FI QAGS S YT TGEMLSLGVGI LVGCLCLLLAVYFI ARK IRKKRLENRKS 
WFT SAQATTEA 




Important features of the protein: 
Signal peptide: 



amino acids 1-15 



Transmembrane domain: 



w 



amino acids 291-310 



N-glycosylation site. 



amino acids 213-216 



Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7S 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
T CTGGTGGTTTGCCTAAACCTGCAAACATCACC TTCT TATCCAT CAACAT GAAGAATGTCCTACA 
ATGGACTCCACCAGAGGGT CTT CAAGGAGT TAAAGT TAC T TACAC T G T GCAGTAT T TCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCC TGTT TCCATGCAACAAATATAC TCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
T C T GT GAT GGGC TAT T CCAT CTACCGAT AT ATCCACG T TGGCAAAGAGAAACACCCAGCAAAT T T 
GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
AC TT TATCACCC TCAATATC TCGGAT GAT T CTAAAAT TTCT CAT C AGG AT AT GAG T T TACTGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACAT TTAGGGTATGCT TCGCATT TGATGGAAATT TT TTGTGACTCTGAAG 
AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
ACAG T CAT T GAATATGAATATGATGT CAGAACCACT GACATTTGT GCGGGGC CTGAAGAGCAGGA 
GC T CAG T T T GCAGGAGGAGGTGT CCACACAAGGAACAT TAT TGGAGT CGCAGGCAGCG T TGGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCC T T CT GAGGGGGATGGGCT CGGAGAGGAGGGT CT T CTATCT AGACT CTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAATTCATGGAGGAATGGGGGTTATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYWSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKIIFWYVLPISITVFLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGC TGAAGGACCACAACGCCACCAGCATCCT GCAGCAGCTGC CGCT GC TCAGT GCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCT GGCT GAAGGT CATCACAGCTAACAT CCT C CAGC TGCAGGT GAAGCCCTCGGCCAAT GACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
fll TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
S CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
ff! TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
W CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 

CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
!3 TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
il ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 

is;:. 

;^ T GAT CAAT GAAAAGGCTGCAGATAAGCTGGGAT CTACCCAGATCGTGAAGAT CCTAAC TCAGGAC 

4 ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
; f fl TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T T TACAC CAAAGGTGACCAACT T ATACTCAACT TGAATAACAT CAGCT CTGATCGGATCCAGC TG 
ATGAACT CT GGGATT GGC T GGT T CCAACCT GAT GT TC TGAAAAACAT CAT CACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 



at 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGG I PVLGSLVNTVLKHI I WLKVI TAN I LQLQVKPSANDQELLVKI PLDMVAGFNT PLVKT I VE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCPVIEASFNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KWFmSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNN I S S DR I QLMNS GI GW FQPDVLKNI I TE I IHS I LLPNQNGKLRSGVPVS LVKALG 
FEAAESSLTKDALVLTPASLWKPSSPVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGAT TAGAGGTGTT CAAGGAGCAAGAGC 
TT CAGCCT GAAGACAAGGGAGCAGT CCCT GAAGACGC TTC TAC TGAGAGGT CT GCCATGGCC TCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CC AAGGGCCT CT GGATGGAAT GT GCCACACACAGCACAGGCAT CACCCAGT GTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
*^ GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
Jjf ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
.m CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
|y CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
III CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 

P GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 

111 

GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
r| GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
HI GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
O TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 

- ^ GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 

Hi 

lis TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 

GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAAT TT GTGCAT GAACTGAAATAAAACCATCCTACGGTAT CCAGGGAACAGAAAGCAGGATG 
CAGGATGGGAGGACAGGAAGGCAGCC T GGGACAT T TAAAAAAATA 
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FIGURE 80 



MAS LGLQLVG Y I LGLLGLLGTLVAMLL PS WKT S S YVGAS I VTAVG FSKGLWMECATHS TG ITQCD 
I YSTLLGLPADI QAAQAMMVTS SAI S S LAC 1 1 S WGMRCTVFCQE SRAKDRVAVAGGVFFI LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRS S PRPGQPPKVKSE FNSYS LTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
P TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
W AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

Zl ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 

Cll 

i B I ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 

ig GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 

O AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 



CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
C T CGT C CACACATCGGTATCCCCAAGCCCAGACAACC T GCGTCGC TT T GCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCT TCCACTT GACT CAGATCAAGAAAGT CAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTC T TCC TAGCCCAGACCCTGGAGCTGAAGGT CACGGCCAGT CCAGACAAAGTGACCAAG 
ACATAACAAAGACC TAA CAG T T GCAGAT AT GAGC TGTATAATT GT T GT T AT TAT AT AT TAATAAA 
TAAGAAGT TGCAT TACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGIALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TC CGT T TCCATGCCGT GAGGTCCATT CAC AGAACAC AT CC ATGGCTCT CAT GCT CAGT T T GGTT CT GAGT C 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGCAGGAC AAAACT GGT GAAGG AT TCT ATT GC GGAGGGGCGC ATCT CT CTGAGGC TGGAAAACATT ACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
AC AGGT GTC AGC ACTGGGC TCAGT TCCT CTCATT T C CAT C ACGGGAT ATGT TGATAGAGAC AT CCAGC T AC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
AC AGACTCC AGGACAAAC AGAG ACAT GC AT GG CCT GTT TG AT GTGGAGAT CTCT CT GACCGT CC AAGAGAA 
CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
GAGAAG AAAGCACGGACAGGCAGAATTGAGAGACGC CCGGAAAC ACGC AGTG GAGGTG AC TCT GGATCCAG 
AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CCTC ACTCT GAGAAGAGAT TTACAAGGAAGAGTGTGGTGGCT T C TCAGAGT T T C CAAGC AGGGAAAC ATT A 
CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
AGGAGT ACGT GACTTT GTCTCC CGATC ATGGGTACT GGGT CCTC AGACTGAATGGAGAAC AT T T GT AT T T C 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCT TATT GAGGCC CTAC AT TG AGTATC CGTC CT AT AATG AGC AAAAT GGAACT CCC ATAGTC AT C TGC 
CC AGT C AC CC AGGAAT CAGAGAAAGAGGCCTCTT GGCAAAGGGCCTCT GC AAT CCC AG AG AC AAGC AAC AG 
TG AGTCCTCC TC AC AGGC AACCACGCCC TTCCTC CC CAGGGGT GAAAT GTAGGATGAATC ACAT CCC ACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CT GACATT AC ATTT AGTT T GCTCT CACT CC ATCT GGCT AAGT GAT C TT GAAAT AC C ACCT CTCAGGTGAAG 
AACCGT CAGGAATT CC C ATCT CAC AGGCTGT GGT GTAGAT TAAGTAGACAAGGAATGT GAAT AATGC T TAG 
AT CT TATT GAT GAC AGAGT GT AT C CT AAT GGTT T GT TC AT TAT ATT AC ACTT TCAGT AAAAAAA 



10063563 



Page 235 



FIGURE 84 

MALMLS LVLS LLKLGSGQWQVFGPDKPVQALVGEDAAFS C FLS PKTNAE AMEVR FFRGQFS S WH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPL I S I TGYVDRD IQLLCQS SGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGL FDVE I SL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 
amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8z 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCT C TGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAAC T GCTGAC GATGCAGAGT TCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAAT TGCACCC T GAGCATCAGAGATGCCAGAAGAAGT GATGGGGGGAGA 
TACT TCT T TCGTATGGAGAAAGGAAGTATAAAATGGAAT TATAAACAT CAC CGGC TC TC TG TGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTC T CCC T GCAGAGCAAAGCCACAT CAGGAGTGAC TCAGGGGG T GGT CGGGGGAGCT GGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCC TT GGGACTCGCGGGGACAGGAGGCCAC T GACAC CGAGTAC TCGGAGATCAAGATCCACAG 
ATGAGAAACTGCAGAGAC T CACCCT GAT T GAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGCTGAT TC T TGTAGAAT TAACAGCCCT CAACGT GATGAGCTATGATAACAC TATGAAT TATG 
T GCAGAGTGAAAAGCACACAGGCTT TAGAGTCAAAG TATC T CAAACC T GAATCCACACTGTGCCC 
T CCC T TT TATT T T T T TAACTAAAAGACAGACAAATT CC TA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
jySTTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
S VLT L I PQPQDHGT S L TCQVT FPGAS VT TNKT VHLNVS YP PQNLTMT VFQGDGT VS T VLGNGS S L 
S L PE GQSLRLVCAVDAVDSNPPARLS LSWRGLTLCP S QPSNPGVLELPWVHLRDAAE FTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLS FCVI FVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 




Signal peptide: 



amino acids 1-15 



Transmembrane domain: 



amino acids 351-370 
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FIGURE 87 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGC T GAGACT CAGACAAGAT TACA ATGA ACCAACT CAGCT TCCT GCTGT TTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACT TACT TCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
p CAACTACAACACCTT TGGATCTGCAGAGGCGGCCACGAGCGAT GACTACAAGAACCCTGGCTACT 

Cl ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 

y l AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 

pj 

3 "^J GTT T GGCAT CTACCAGAAATAT CCAGTGAAATAT GGAGAAGGAAAGTG TT GGACTGACAACGGCC 

~? ii 

m CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 

|y GGCCAGCGGGAATTCACT GCGGGAT T T GT T CAGT TCAGGGTAT T TAATAACGAGAGAGCAGCCAA 

* CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 

h! GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 

p% GGAACTCAT GT TGGT TACAGCAGCAGCCGTGAGATAAC TGAGGCAGC T GT GCTTC TAT T CTATCG 

?|| T TGAGAGT T T T GT GGGAGGGAAC CC AGAC CTCTCCTCC CAAC CAT GAGAT C C CAAGGAT GGAGAA 

Q CAACT TACCCAG TAGC TAGAAT GT TAATGGCAGAAGAGAAAACAATAAAT CATATT GACTCAAGA 

CI.J AAAAAA 
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FIGURE 88 



MNQLS FLL FL I AT TRGWS TDEANT Y FKE WTCS S S PS L PRS CKE I KDECPS AFDGL YFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
S DDYKNPGYYD IQAKDLG I WHVPNKS PMQHWRNS S LLRYRTDTG FLQTLGHNL FGI YQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHC I GGGGYFPEAS PQQCGDFSGFDWS GYGTHVGYS S SRE I TEAAVLL FYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
!"! TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
Q TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
|f| GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
Ul. TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
W T TAAGACT TATATACAGT T T TAGGGGACAATTAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLWVITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASV7iLSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 



Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 

Q 
O 

m 
w 

II! 
CP 

iy 

CI 

III 

o 
ru 
o 

ru 
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FIGURE qi 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATCCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
- : GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
Q AG T T CCACCAGGAACAGCAGAT CT TCATC GAAGCCAAGGCC TC CAAAAT C T TCAAC T GCCGGAT G 

©i GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 

CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
%l TCTACATCGCC T TC TACAGCACGGAC TAT CGGC TGGTCCAGAAGG TGT GCCCAGAT TACAAC TAC 

| § | CAT AG TGATACCCCC TACTACCCATC T GGG TGA. CCCGGGGCAGGCCACAGAGGCGAGGCCAGGGC 

n TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
13 AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
^ ? AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 

Cf 

w ' AGGAGGAGT GGGCT CTCT GT GCAGCCTCACAGGGCT T T GCCACGGAGCCACAGAGAGAT GC TGGG 

ill 

Q TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
ffj CCTT GGTT C TTGCCATCC TGAGGAAAGATAGCAACAGGGAGGGGGAGAT T T CATCAGTGTGGACA 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCT TAAGCTAAGACAGGACGATTGTGGTCC TCCCACAC TAAGGCCACAGCCCAT C 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCC FVFLVQGS L YLVI CGQDDGPPGSEDPERDDHEGQPRPRVPRKRGH ISPKS RPMANST L 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKA.VEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCC ATGA CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGT TCCGATT TGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGT TTGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCT GAAAC T C T GCCTGC TC TGCCAAGACAAGAAC TTTCTTCTT TACAACCAGCGC TCCAGAT 
AAC C T C AGG GAAC C AGC AC T TC C CAAACC G C AGAC T AC AT C T T T AGAG GAAG CACAAC T G T GC C T 
T T TT C T GAAAATCCC T T TT T CT GGTGGAAT T GAGAAAGAAATAAAAC TAT GCAGAT A 
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FIGURE Q4 

MTAAVFFGCAFIAFGPALALYVFT IAIEPLRIIFLIAGAFFWLVSLLISSLVWFMARVIIDNKDG 
PTQKYLLI FGAFVSVYIQEMFRFAYYKLLKKASEGLKS INPGETAPSMRLLAYVSGLGFGIMSGV 
FS FVNT L S DS LGPG TVG I HGDS PQFFLYS AFMTLVI I LLHVFWG I VFFDGCEKKKWG I LL I VLLT 
HLLVSAQT FI S S YYGINLASAFI I LVLMGT WAFLAAGGSCRS LKLCLLCQDKNFLL YNQRS R 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium:neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 



9» 



in 



AAT T T T T CACCAGAGTAAAC T T GAGAAACCAAC T GGACC T TGAG TAT T GT ACAT TTTGCCTCGTG 
GACCCAAAGGTAGCAATC TGAAACATGAGGAGTACGAT TC TACT GTTTTGTCTTC TAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCTTCTTTAAGTCTGATACCATTAACA 
CAGAT GC T CACAC T GGGGC CAGATC T GCATCTGT TAAATCCTGCT GCAGGAAT GACACC TGGTAC 
CCAGACCCACCCAT T GACCCT GGGAGGGT T GAATGTACAACAGCAAC TGCACCCACATGTGTTAC 
CAATT TTTGTCACACAACT TGGAGCCCAGGGCAC TATCCTAAGCTCAGAGGAAT TGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
T AAT CCAGATGTCCAGGAT GGAAGCC TT CCAGCAGGAGGAGCAGGT GTAAAT CC TGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
G TAA GCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT TGAT TATATTAT GGAATAGAT T GAGACACAT TGGATAGTCT T AGAAGAAAT TAAT T C T TAAT T 
TACCT GAAAATATTC T TGAAAT T TCAGAAAATAT GT T CTATGTAGAGAATCCCAACT TT TAAAAA 
CAAT AAT T C AAT GGAT AAAT CT G T C T T T GAAATAT AACAT TAT G C T G C C T GGAT GAT AT GCAT AT 
TAAAACATATT TGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



W AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Q6 



M 
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MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
L PQ IFTSLIIHS L FPGG I L P TS QAGANPDVQDG S LPAGGAGWPATQG T PAGRL P T P S G 
T DDD FAVT T PAG I QRS THAI EE AT TE SANG I Q 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCC T GT GCT T GTGCACGTGAAGGAT GAGAAT GACCAGGTGCC CCAT T TCT CTCAAGCCAT CTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
T GCAGAGGGAAACC T C TACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCT GAGTACC TGC TCC 
AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
CAGT CCACCAGGTAC T GAAGTGACTAGAC TGTCAGCAGAGGAT GCAGATGC CCCCGGCT CCCCCA 
ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
T CGAAG TCGCAGTCACAGATATCAAT GAT CACGCCCC TGAGTTCATCACTT CCCAGAT T GGGCCT 
ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
CCTCGAGC CCGCC TTCCGCC TCAT GGAT T T T GCCAT TGAGAGGGGAGACACAGAAGGGAC TT T TG 
GCC T GGAT T GGGAGCCAGAC TC T GGGCAT GT TAGACT CAGACT CT GCAAGAACC TCAGT T ATGAG 
GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAAC CCT CAGGT TCTCCC TAGT CAAT GAC T CAGAGGGC T GGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
T ACC T C T G CAC AC CC C GCC AAGAC CAT G GC T T GAT C GT GAG T GGAC C CAGC AAG GACC C CGAT C T 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATTTTCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGT GGAGGCAC CAT CAC CAT CAC CAGGCAT G T C T GC AGAG CCT GGAC AC C AAC T T 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE Q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S T SLDHALERT YQLLVQVKDMGDQASGHQAT ATVEVS I IESTWVSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEYNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVVVWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPI SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHI I PVWSHNAQMWQLLVRVI VCRCNVEGQCMRKVGRMKGMPTKLSAVG I LVGTLVAIG I FL I 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT CCAAGAT T CT TCCCAGGAACACAAACG TAGGAGACCCACGC TCC T GGAAGCACCAGCC T T T A 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATCT TGCTTT CCAGCCACAAAGAGACAG ATGA AGAT GCAGAAAGGAAATG 
TTCT CCT TATGT T TGGTC TACTAT T GCAT T TAGAAGC TGCAACAAAT T CCAAT GAGAC TAGCAC C 
TCT GCCAACAC TGGATCCAGT G T GAT CT CCAG TGGAGCCAGCACAGCCACCAAC TCT GGGT CCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGT CACCAAC T C TGAGTT CCATACAACC TCCAGTGGGATCAGCACAGC CACCAAC TC T GAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAAC TCT GAG T CCAGCACACCC TC CAGT GGGGCCAGCACAGTCACCAAC T 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGT AGGGC CAGCAC TGCCACCAACT C TGAGTCTAGCACACTCT CCAG T GGGGC CAGCACAGC CAC 
CAAC T CT GAC TCCAGCACAACCTCCAGT GGGGC TAGCACAGCCACCAACT C TGAGT CCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
GCCACCAACT C T GAGTCCAGCACAACCT CCAG TGGGGCCAGCACAGC CAC CAACTCT GAGT CCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGC CACCAAC T C TGACT CCAGCACAGTGT CCAGTGGGGCCAGCACTGCCACCAAC TC TGAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGCTAGCACAGC CAC CAAC TCTGACTCCAGCACAACCTCCAGT GGGGC CGGCACAGCCACCAACT 
CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
AGTGGGGCCAACACAGCCACCAACT C TGAG TCCAGTACGACCT CCAGT GGGGCCAACACAGCCAC 
CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
CCT CCAGT GGGG TCAGCACAGCCACCAACT CT GAGT CCAGCACAACCT CCAGTGGGGC TAGCACA 
GCCACCAAC TC TGAC TCCAGCACAACCT CCAGT GAGGCCAGCACAGCCAC CAAC TC TGAGT C TAG 
CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGC TCT GGAACAGCAGCTC TGACT GGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCTTGAAGAAGGTATTCCTCACCTTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATAT 
T GCTCAT T TAGC TAAGAAATAAAT ACATC T CATC TAACACACACGACAAAGAGAAGC TG TGC T T G 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT CT GGCAT TCAAAATC T CCACAG TAAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GAS T VTNS GS S VT S S GAS T ATNS ESS TVS SRAS TATNSE S S T L S S GAS T ATNS DS S T T S S GAS T A 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAG T ATNS ESSTVSSGISTVTNSESSTPSS GANTATN S E S S T T S S GANT ATNS E S S T VS S GAS T A 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTS SEAS TATNSESSTVS SGI STVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNT FNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 



AMEMSGRNSGP 




Signal peptide: 



amino acids 1-20 



Transmembrane domain: 



amino acids 510-532 



o 



ru 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAAC TAT CAGGCTGAAAAACAGAGTGGGTAC TC TC T TCT GGGAAGC TGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AAC GAGGCAGT T TC TAGC TAC T GCACGT ACTTCATAAAGCAGGACT C TAAAAGC T T TGGAATCAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCA/TCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
O GAT TATAACTGGGGATGCAT TTGTT CCTGGAGAAAGAAGTGTCATTAT CATGAACCATCGGACAA 

J=f GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
7l TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
ip CT T CAT TCAT AGGAAAT GGAAGGAT GACAAGAGCCAT TT C GAAGACAT GAT T GAT TACT TT TGT G 

Ifl ATAT T CACGAACCAC T TCAACT CCTCATAT TCCCAGAAGGGAC T GAT CT CACAGAAAAC AGCAAG 

W TC TCGAAGT AATGCAT T TGCTGAAAAAAAT GGAC T T CAGAAAT ATGAATAT GT T T TACATCCAAG 

* m AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
?J3 ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
Q CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
flJ CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
CI GGGAGAAGAATTTTTATTTTACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 

m 

CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATAT T TGTACAGTCT T GT TAAGT GGTAT T T TATAAT CACCAT TGTAAT CT TT GTGCT GCAAGAGA 
GAAT AT T T GGTGGACT GGAGAT CATAGAAC TTGCATGT TACCGACT T T TACACAAACAGCCACAT 
T TAAAT T CAAAGAAAAAT GAG TAAG ATT AT AAGG TT TGCCATGT GAAAACCTAGAGCATAT T T TG 
GAAAT GT T C TAAACC TTTC TAAGCTCAGAT GCAT TT T TGCATGACTAT GT CGAATAT T TC T TACT 
GCCATCATTAT TTGT TAAAGATATTT TGCACT TAATT TTGTGGGAAAAATATTGC TACAATT TTT 
TT TAATCTCTGAATGTAAT TTCGATACTGT GTACATAGCAGGGAGTGATCGGGGTGAAATAAC TT 
GGGCCAGAATAT TAT TAAACAAT CAT CAGGCT T T TAAA 
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FIGURE 102 

MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWG5FFG5IFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCIMRYSYLRLEKICLKASLKGVPGFGWi^QAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFS PAMCLLIYLYSLVKWYFIITIVI FVLQERIFGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE IPS 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
T CCAAAT CAT CCAT CCACCCCT GCT GTCAT CTG TT T TCATAGT GTGAGAT CAAC CCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCTCTACAGAGATGGGGAAGACTGGGAATCTAAGCAGATGCCACAGTATCGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
* s : AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
% ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
W GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
CI TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
If] AAATCCAAAGGGAAAATCCAGGCGGAAC T GGAC TGGAGAAGAAAGCACGGACAGGCAGAAT TGAG 

Lj AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
?p T TTC T GAT CT GAAAAC T GTAACCCATAGAAAAGC TCCCCAGGAGGT GCCT CACT CTGAGAAGAGA 

Z.! T TT ACAAGGAAGAGT GTGGTGGCTT CT CAGGGT T TCCAAGCAGGGAGACAT TAC TGGGAGGT GGA 

65 CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
||| ATGT GACT TT GTCTCCCAACAAT GGG TAT T GGGTCC TCAGACT GACAACAGAACAT T TGTAT TTC 

s ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 

GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
?% T GACATGTCAGT TTGAAGGCTTGTTGAGACCC TATATCCAGCATGCGATGTATGACGAGGAAAAG 

Up GGGACTCCCATATTCATATGTCCAGTGTCCTGGGG ATGA GACAGAGAAGACCCTGCTTAAAGGGC 
55 CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
f|| CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
"fk TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
If! TCAGAAGCCAT GGCTGCCC TGAAGTGGGGACGGAATAGAC T CACAT TAGGT T TAGT T T G T GAAAA 

m C T CCATCCAGC TAAGCGATCT T GAACAAGT CACAACC TCCCAGGCT CCTCAT T T GCTAGTCACGG 

ACAGTGATTCC TGCC TCACAGGTGAAGATTAAAGAGACAACGAATGTGAATCATGCT TGCAGGTT 
T GAGGGCACAGTGT T TGCTAAT GAT GTGT T T T TAT AT TATACAT T TT CCCACCATAAAC TC TGT T 
T GCT TAT TCCACAT TAAT T TACT TT T CT C TATACCAAAT CACCCATGGAATAGT TAT TGAACACC 
T GCT T TGTGAGGC TCAAAGAATAAAGAGGAGGTAGGATT T T TCACTGAT TC TATAAGCCCAGCAT 
TACC T GATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGT CCATATCC 
C T CAT TAAC AC AGACACAAAAAT T C T AAATAAAAT T T TAACAAAT T AAACT AAACAAT ATAT T TA 
AAGAT GATATATAAC TAC T CAGT GT GGTT TGTCCCACAAATGCAGAGT TGGT T TAATAT TTAAAT 
ATCAACCAGTGTAAT TCAGCACAT TAATAAAG TAAAAAAGAAAACCAT AAAAAAAAAAAAAAA 
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FIGURE 104 

MAFVL I LVLS F YELVSGQWQVT GP GKFVQALVGEDAVFS C SL FPET S AEAMEVRFFRNQFHAVVH 

LYRDGEDWESKQMPQYRGRTE FVKDS I AGGRVS LRLKNI T PS D I GLYGCWFS S QI YDEEATWELR 

VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 

IVQENAGSILCSIHLiAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGWMGMIIVFFK 

SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 

TRKSWASQGFQAGRHYWEVDVGQNVGWWGVCRDDVDRGKNNVTLSPNNGYW 

FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 

TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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CC TT CACAGGAC TCTTCATT GCTGGT TGGC AATGA T GT ATCGGC C AGAT GTGGT GAGGGCT AGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACT GT TC AT TATGTG AG AT AT AATC AAAAGAAGACC T AC AAT T ACT AT AGC AC ATTGTC AT TT AC AAC 
T GAC AAACT ATATGCT GAGT TTGG C AGAGAGGCTT CTAACAATT TT AC AGAAAT GAGCC AGAGACTTGAAT 
CAAT GGTGAAAAAT GC AT T T T ATAAATCTCC AT T AAGGGAAGAATT T GTC AAGT CT CAGGTT AT C AAGTT C 
AGTC AAC AGAAGCAT GGAGT GTTGGC T CAT AT GCT GTT GATT TGT AGATT T C AC TCT ACT GAGG AT CC T GA 
AACT GT AG AT AAAAT T GT T C AACT TGT T T T ACAT GAAAAGCT GC AAGATGCTGT AG GACC CCCT AAAGT AG 
3 : AT CC T C ACTC AGTT AAAATT AAAAAAAT C AACAAGACAGAAAC AGAC AGCT ATCT AAACC AT T GCT GC GGA 

f=| AC ACGAAG AAGT AAAACT C T AGGT CAGAGT CT CAGGAT CGTT GGTGGGAC AGAAGT AG AAGAGGGT GAAT G 

Q GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
||| TTGT GAGT GCT GCTCACT GT T T TACAAC AT AT AAGAAC CCTGCC AGATGG ACTGCT TCCT TT GGAGT AAC A 

PJ AT AAAACCTT CGAAAATG AAACGGGGTCTCCGGAGAAT AATT GT CC AT GAAAAATAC AAACACCC AT C AC A 

%fl TGACT ATGAT AT TT CT CT T GCAGAGCTT TC TAGC CCTGT TCCCT AC AC AAATGC AGT AC ATAGAGT TT GT C 

m 

TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 

III 

GATGGT TACAGTC AAAAT CATCTT CGAC AAGC AC AG GT GACT CT CAT AGACGCT AC AACT TGC AAT GAACC 

i- 

TC AAGCT T ACAATGAC GCCAT AACTCCT AGAATGT TAT GT GC TGGCTCCT T AGAAG GAAAAACAGATGCAT 
{fa GCCAGGGTGACT CT GG AG GACCACT GGT T AGT TC AG ATGC TAGAGAT ATCTGGT AC C TTGCT GGAAT AGT G 

f-% AGCTGGGGAGATGAATGTGCGAAACCCAACAAGCCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTG 
f|| GAT T ACTTC AAAAACT GGT ATC TAAG AG AC AAAAGCCT CATGGAAC AGAT AACATT T TTTTTTGTTTTTTG 

Q GGT GTGGAGGCCATTT TT AGAG AT AC AGAATTGG AG AAGACT TGCAAAAC AGCT AG AT TT GACTGATCT C A 

fi| ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 

GATC AACTCTGT CAT CTGTGAGCAAT AGT T GAAACT TT ATGT AC AT AGAGAAAT AGATAATACAAT ATT AC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TCAAGGTGCAGAAC AAGGAGTGAAAG AAAAT AT AAGAAGAAAAAAATCCC CT AC AT T TTATTGGCACAGAA 
AAGT AT TAGGTGTTT T TCTT AGTGGAAT AT T AGAAATG AT CAT ATT CAT TAT GAAAGGTC AAGC AAAGAC A 
GC AGAATACCAATC AC TT CATC AT T TAG GAAGT AT G GGAAC T AAGT T AAGGAAGTCCAGAAAGAAGCC AAG 
AT AT AT CC TTAT TTT C ATTT CC AAAC AACT ACT AT GAT AAAT GT GAAGAAGATTCT GTT T TT TTGT GACCT 
AT AAT AATT AT AC AAACT TC AT GC AATGT ACTTGT T CT AAGC AAAT T AAAGC AAAT AT T T AT TTAACATT G 
T TACTGAGGATGTCAACATATAAC AAT AAAAT AT AAAT CACCC A 
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FIGURE 106 



MMYRPDWRARKRVCWEP WVI GLVI FI SL I VLAVC IGLTVHYVRYNQKKT YNYYS TLS FT TDKLY 
AEFGREASNNFTEMSQRLESMYKNAFYKSPLREEFVK^ 

PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL IDATTCNEPQAYNDAI T PRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 107 



AGAGAAAGAAGCGT C TCCAGC T GAAGCCAAT GCAGCCC TCCGGC T C T CCGCGAAGAAG TTCCCT G 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGT TCT CACT T TAACT GCAGAAACGAGGG TAGAGGAAGCAGT CAT T T TGAC T TAC T T TCC T 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
G TGT AGAACT GGC T TGTGGCG T T T GGACATATGAACAGGAACT T AT GGT T CCAGTACAAT GGT CA 
GATAT GGTCAC T T T GAAAGCCAGGATGACAAAT TATGGAT TACC TAGATAT CGG TGGC T TAC TCA 
TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
AAAT GAC AGAGAT GGAC T GGC C CC C AGAT T CC T GC T G T G T T AGAG AAT T CC CAGGAT G T T C C AAA 
CAGGCCCACCAGGAAGAT CTCAGT GACC T T TATCAAGAGGGT T GTGGGAAGAAAAT G TAT T CCT T 
T TT GAGAGGAACCAAACAACTGCAGGTGC TGAGGT T T CT GGGAATCT CCAT TGGGGTGACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAATGAT GTCC T T GAAGAATGACAAC TC TCAGCACC TGT CAT GT CCC T CAGTAGAAC T 
GT TGAAACCAAGCC TGTCAAGAAT CT T T GAACACACATCCAT GGCAAACAGC T T TAATACACACT 
TTGAGATGGAGGAGT TA TAAA AAGAAATGTCACAGAAGAAAACCACAAACTTGT TTTAT TGGACT 
TGT GAAT T T T T GAGTACATACTATGTGT T TCAGAAATATGTAGAAATAAAAATGT TGCCATAAAA 
T AACACC TAAGCATATAC TAT TC TATGCT T TAAAATGAGGATGGAAAAGT T TCATGT CATAAGTC 
ACCACCT GGACAATAAT TGAT GCCCT TAAAAT GC TGAAGACAGATGT CATACCCACTGTGTAGCC 
TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
T CCAT GCAAACGAGT CACATAT GGT GGGAC TGGAGCCATAGTAAAGGT T GATT TACT T CTACCAA 
CT AGTATATAAAGTAC TAAT TAAATGCTAACATAGGAAGT TAGAAAATAC TAATAAC TT TTATTA 
CTCAGCGATCTATTC TTC TGAT GCTAAATAAAT TATATATCAGAAAAC T T T CAATAT TGGT GAC T 
ACCT AAAT GT GATT T T TGC TGGT TAC TAAAAT AT TC T TAC CACT T AAAAGAGCAAGC TAACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
T T CAGTT CT GATAAT GT TAAGAATAACCAT T ATGAAAAGGAAAATT TGTCCT GTATAGCAT CAT T 
ATT T T TAGCCT TTCC TGT TAATAAAGC T T TAC TATT CTGT CCTGGGC T T AT AT TAC AC AT AT AAC 
T GT TATT TAAATACT TAACCAC TAAT T TT GAAAATTACCAGTGT GATACATAGGAATCAT TATT C 
AGAATGTAGTCTGGTCTTTAGGAAGTAT TAATAAGAAAATT TGCACATAACT TAGTTGATTCAGA 
AAGGACT TGTATGC TG TT T T TC TCCCAAAT GAAGAC TC T T T T TGACAC TAAACAC TT T T TAAAAA 
GCTTATCT TTGCCT TCTCCAAACAAGAAGCAATAGTC TCCAAGT CAATAT AAATTCTACAGAAAA 
TAGTGTTCTTTTTCTCCAGAAAAATGCTTGTGAGAATCATTAAAACATGTGACAATTTAGAGATT 
CTTTGTTT TATT TCACTGAT TAATATACTGTGGCAAATTACACAGATTATTAAAT TTTT TTACAA 
GAGT ATAGTATATT TATT TGAAAT GGGAAAAGTGCAT TT TACT GT AT T T T GTGTAT T T T GT T TAT 
T T CT CAGAAT ATGGAAAGAAAAT TAAAAT GT GT CAATAAATAT T T TCTAGAGAGTAA 
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MAREDSVKCLRCLLYALNLLFWM 

VMI AVCCFLI I VGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYSQELMVPVQWSDMVT 
LK7VRMTNYGLPRYRWLTHAWNFFQREFKCCGWYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE lOQ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGT CT T TC T C GT GAGAGC CT AGAGGC CT T AAAAAAAAAAGT GC T T GAAAGAGAAGGG GAC AAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGT GGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCTGGAGGAACACT GACCACC AGTT GGATGAGTCT C AAGGTCCTCCTCT AT GTGAC AACC AT GT GAAT GGGGAG 
TGGTACCACTT CACGGGCATGGCGGGAGAT GC CAT GCCTACCT TCT GC AT AC C AGAAAAC CACT GTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
CAGCTGCTCAGAT ACC AGC GAGT GCACATGCGCTCC AGGAACT GT GCTAGGCCCTGACAGGCAGACAT GCTTT G 
AT GAAAAT GAATGT GAGCAAAACAACGGTGGCT GCAGT GAGAT CTGT GT GAACCTC AAAAACTCCTACCGCTGT 
GAGTGTGGGGTTGGCCGTGTGCTAAGAAGTGATGGCAAGACTTGTGAAGACGTTGAAGGATGCCACAATAACAA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CT GAGGAT AAC C ACAC T T GCC AAGT CC C T GT GTTGT GCAAATCAAAT GCCAT TGAAGT GAACAT CCCC AGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCTCTTCTCTCTCAAGACAT GT GGT AC AGT G GT C GAT GT GGT GAAT G AC AAG AT T GT GGCC AGC AAC C T C GT GA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
CAT GAGCCGAAATCAT GGGATCT TCCC AT T C ACTCT GGAGAT C TTCAAGGAC AAT GAGTT T GAAGAGC CT T ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTTGGT GGAGAGCT GCTTT GCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
C GGT CT AC AGG GCC AGAC GCT AAC AGGC GGC C C GATCC GC AT C GACT GGGAGGAC TAG TT C GT AGCCAT AC CT C 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACT GCTGAACAATGTGGCCT GGGTGGGGTTTCATCTTTCTAGGGTT GAAAACT AAACT GTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAAT CAGAAGC T GGGT AT AAT AT T T C AAGT T AC AAAC CCT AGAAAAAT T AAAC AGT TACT GAAATT AT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TT AT AGG GAAT TTGGAAGT GT AT C AAT AAAACAGT AT AT AAT T T T 
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MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 
amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93 f 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites, 
amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxy lati on site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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G AGAGAGGC AGC AGCT TGCT C AGCGG AC AAGG AT GC TGGGCGTG AGGG ACCAAGGCCT GC CCT GCACT CGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CC TGCT GCCT TGGG GT GAC AATCT CAGCT C C AGGCT AC AGGG AG ACCGGGAGGAT C ACAG AGCC AGC ATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
AT GGAG ACCTTC AG AAAGGT GGGG ATCCCC AT C ATC AT AGC ACT ACTGAGCCT GGCGAGT AT C ATCAT TGT 
GGTT GT CCT C AT CAAGGT GATTCT GGAT AAAT ACT ACT TCCT CT GCGGGC AGCCTCT CCACT TC AT CC CG A 
GG AAGCAGCTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTTC 
2^ CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
Q AGGGAACT GGTT CT CT GCCT GT TT CGACAACT TC AC AG AAGCTC TCGCTGAG AC AGCCT GTAGGC AGATGG 

Q GCT ACAGCAGAGCT GT GGAG AT TGGC CCAGACC AGG AT CTGGAT GTTGTT GAAATCAC AG AAAACAGC CAG 

ffl GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
W TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
T CAGCATCCAGT ACGAC AAAC AGC ACGT CT GT GGAGGG AGCAT CCTGGACCCCC AC TGGGT C CTCACGGC A 

#fii 

Y** GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGT GCGGGC AGGCTCAGACAAACTGGGC AG 

III 

CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGAC A 

1,. TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 

1*1 

5 55 GATG AG GAGCTC ACT C CAGCC ACCCC ACTCTGGAT CAT TGGAT GGGGCTT TACGAAGCAGAATGGAGGGAA 

M I 

£*'| G ATG TC TGAC AT ACT GCT GC AGGCGTCAGT CC AGGTCAT T GAC AGCAC ACGGTGCAATGC AGACGATGCGT 

f|| ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 
f!| GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
fll CTGCGGGGGCCC GAGC AC CCCAGGAGTAT AC ACC AAGGTCTC AGCCT ATCT CAACT GGAT CT ACAATG TCT 

GGAAGGCTGAGCTG TAAT GCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGG AT CCCC CAAAGT CAGACACAGAGCAAGAGT CC CCTT GGGT AC ACCC CTCTGCCC AC AG CCTC AG CAT 
T TCT TGGAGC AGCAAAGGGCCT CAATTCCTGT AAGAGACCCT CGCAGCCC AG AG GCGCCC AGAGGAAGTC A 
GC AGCCCT AGCTCGGCCACACT TGGTGCT C CC AGCATC CC AGGGAG AGAC AC AGCCCACT GAAC AAGGT CT 
CAGGGGTAT T GCTAAGCC AAGAAGGAACT T TCCC AC AC TACTGAATGGAAGC AGGCTGTCTT GTAAAAGCC 
CAGATC ACT GTG GGCT GGAGAGGAGAAGGAAAGGGT CT GC GCCAGCCCTGTCCGTCTTCACCCATCCC CAA 
GCCT AC T AGAGC AAGAAACC AGTT GT AATAT AAAAT GC AC TGCCCT ACTGTTGGTAT GAC TACCGT TACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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FIGURE 112 

MLQDPDSDQPLNSLDVKPLRKPRI PMETFRKVGIPI I IALLSLAS 1 1 1 VWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEAIJIETACRQMGYSRAVEIGPDQDLDVVEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVS IQYDKQHVCGGS ILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMS DI LLQAS VQVI DS TRCNADDAYQGE VTEKMMCAG I PEGGVDTCQGDS GGPLMYQS 
s . DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Q Transmembrane domain: 

V 3 amino acids 32-53 (typell) 
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FIGURE lis 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 

TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 

TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 

TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 

GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 

CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 

GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 

CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 

AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 

GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 

ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 

CAGAAGGGGTC TGC T TAGACCACCT GGT T TAT GTGACAGGACT TGCATTCTCC T GGAACATGAGG 

GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 

GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 

CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 

CCACGTACCAGC TGC TGAAGGGCAACTGCAGGCCGAT GC T C T CATCAGCCAGGCAGCAGCCAAAA 

TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 

CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 

GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 

GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 

CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 

CAAATGAT CT CCAAGGGCCCT T ATACCCCAGGAGACT TT GATT T GAATT T GAAACCCCAAAT CCA 

AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 

T T T GGGAGGCC GAGGCGGGTAGATCACC T GAGGTCAGGAGT TCAAGACCAGCCT GGCCAACAT GG 

TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 

CCAGCTACTCGGGAGGCTGAGACAGGAGAATTACTTGAACCTGGGAGGTGAAGGAGGCTGAGACA 

GGAGAAT CACT T CAGCCTGAGCAACACAGCGAGACT C TGT C TCAGAAAAAATAAAAAAAGAAT T A 

TGGTTATTTGTAA 



FIGURE lid 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 



Signal peptide: 

amino acids 1-15 



u 

Q 
Q 

m 
u 
in 
m 

la 

O 
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ro 
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FIGURE lis 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATCGCAA 
AGAATCCTCCAGAGAATTGTGAAGACTGTCACATTCTAAATGCAGAAGCTTTTAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GT T T TGGGGGAGCAAGCACTTC TGGCCGGAGGTACCCAAAAAAGCCT ATGACAT GGAGCACAC T T 
TC T ACAGCAAT GGAGAGAAGAAGAAGAT T TAC AT GGAAAT T GAT CC T GT GACCAGAAC T GAAAT A 
T T CAGAAGCGGAAAT GGC AC TGATG AAAC AT T GGAAGTGCAC GACT T T AAAAACGGAT ACAC TGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
CT GAACCAGAAGAGGAAATAGATGAGAATGAAGAAATTACCACAACT TT CT TTGAACAGTCAGTG 
ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAA7VATTCCAAAATTCTGGA 
GAT TT GT GATAACGT GACCATGT AT TGGATCAATCCCACTCTAATAT CAGT T TCTGAG T TACAAG 
ACTTTGAGGAGGAGGGAGAAGATCTTCACTTTCCTGCCAACGAAAAAAAAGGGATTGAACAAAAT 
GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
AGAACTTCCAATAAATGACTATACTGAAAATGGAATAGAAT TT GATCCCATGCTGGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T TATCACCAAAAAAAAAAAAAAAAAAA 



FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FSE PEEE I DENEE I T TT FFEQSVI WVPAEKP I ENRDFLKNSKI LE I CDNVTMYW INPTL I S VSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CC GT GT TCC AGT AC GAAGGGCT CT GG AGGAGC TGCGT GAG GCAGAGTTC AGGCT TC AC CGAATGC AGGCCC 
T AT TT C ACCATCCT GGGACT TCCAGC C AT GCT GC AGGC AGTGCG AGCCCT GAT GAT CGT AGGCAT CGT CCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
lj TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
f% GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
|3 TCAC AC T AAT TGGGGGT GTG ATGATGT GC ATCGCCT GCCGGGGC CT GGCACC AG AAGAAACC AACT AC AAA 

111 GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
III TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
111 CT T C CAAGC ACG AC T ATGTGTAATGC TCT AAGACCT CT CAGCAC GGGCGG AAGAAACT CC CGGAGAGCTC A 

W« CCC AAAAAACAAGGAGAT CCCATCT AGATT TCT TCT TGCTT T TGACTCAC AGCT GG AAGT TAGAAAAGCCT 

W CG AT TT C AT CTTTGGAGAGGCC AAAT GGT C TT AGCCTC AGTCTCT GTCTC TAAATATTCC AC C ATAAAAC A 

%„ GCTGAGTT AT TT AT GAAT T AGAGGCT AT AGCT CAC ATT T TC AAT CCTCT ATTTC TT T TTT T AAATAT AACT 

?J« TTCT ACTCTG AT GAGAGAAT GT GGTTTT AAT C TCTCTC TC ACAT TTTG AT GATT TAGAC AGACTCCCC CT C 

f4 T TCCTC CT AGTC AAT AAACCCATT GATGAT CT AT TT CCC AGCTT ATCCCC AAGAAAACTT TT GAAAGGAAA 

f|| GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
Q ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 
f|| ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 

TTGAC ATCT T CT T ATT ACAGCAAC ACCATT CT AGGAGT TT CCTGAGCT CTCC AC TGGAGT CCTCTT TCTGT 
CGCGGGTCAG AAATT GTC CCTAGAT GAATGAGAAAATT AT TT TT TT T AAT TT AAGT CCT AAATAT AGTTAA 
AAT AAAT AATGT TT T AGT AAAATGAT AC ACT AT C TCTGTG AAAT AGCCTC ACCCCT ACAT GT GGAT AGAAG 
GAAATGAAAAAATAAT TGCTT T GACATT GT CT AT AT GGT ACT T T GT AAAGTC AT GCT T AAGT AC AAAT T CC 
AT GAAAAGCT CACACC TGT AAT CCTAGC ACTTTGGGAGGCTGAGGAGGAAGG AT CACT TG AGCC CAGAAGT 
TCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCA 
TGGTGGCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGGAGGT 
TG GGGC T GC AGT GAGC CAT GAT CAC ACC ACTGC ACT CCAGCCAGGT GACATAGCGAGATC CTGTCT AAAAA 
AAT AAAAAAT AAAT AATGGAACACAGCAAG TCCT AGGAAGTAGGT T AAAACT AAT TCT TT AA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAA.TGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRALMIVGIVLGAIGLW^ 

CAI AGVS VFANMLVTNFWMS TANMYT GMGGMVQTVQTRYT FGAAL FVGWVAGGL TL I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 

GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCC CAGAGGATAATGGCAACCCATGCCT T AGAAAT CGCT GGGCT GTTTCTT G 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGT GCACGGGGGACAATGAGAAGGT GAAGGCTCACAT TCTG CTGACGGCTGGAATCATCT T CAT 
jU CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
Cl ACT CAATAGTGAATGTT GCCCAAAAACGT GAGCTT GGAGAAGCTCTCT ACT TAGGATGGACCACGGCA 

|*| CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
fl CAGATACTCGAT ACCT T CCCAT CGCACAACCCAAAAAAGTTAT CACACCGGAAAGAAGTCACCGAGCG 

TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
ill; ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
I|| AATCTTAATTACAGGAACT GTGCATCAGCTATTTAT GATTCTATAAGCTAT TT CAGCAGAATGAGAT A 

5 TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
I J CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
2| ACGACATAGCAT TATGTACATAGATGAGTGTAACATTTATATCTCACATAGAGACAT GCTTAT AT GGT 

S! TTTATTTAAAAT GAAATGCCAGTCCAT TACACT GAATAAATAGAACTCAAC TATT GCTTTTCAGGGAA 

=P1i ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
f|l TTTATAAT GAAGATT AAAAT GAAGGCTTTAAT CAGCATT GTAAAGGAAATT GAATGGCTT TCTGATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATATT TTTGTTTT T GTATTTGAAGAAGAATGATGCATTTT GACAAGAAATCATATAT GTATGGAT 
ATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACATT TATAT TAAT AAATTGTACATTTTT CTAATT 
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FIGURE 120 



MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRMQCK 
IYDSLLALSPDLQAARGLMCAASV^^ 

MWLIPVSWVANAIIRDFYNSIVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT TCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCT T TGAGGAGTCCTGGACACCCAAC TAC 
I** AAGCAG T GT T CATGGAGT TCAT TGAAT TATGGCATAGATC TT GGGAAAAT T GCGGAGT GTACAT T 

pi TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
ffl GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
W AT T GAAGCTATAAT T TAT T TGGACCAAGGAAGCCCTGAAATGAATT CAACAAT TAATATTCATCG 

^ CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
71 TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
AT TAT T GAAGAACTACCAAAATAAAT GCT T TAAT T T TCAT T TGCTACCT C T T T T T T TATTAT GCC 

••5? ~ — 

Q T T GGAAT GGT T CAC T TAAATGACAT TT TAAATAAGT T TATGTATACATCTGAAT GAAAAGCAAAG 

If! CTAAATAT GT T TACAGAC CAAAGTGTGAT T T CACACT GT T T TTAAAT C T AGCATTAT TCAT T TTG 

CTT CAAT CAAAAGTGGT T TCAATAT T T T T T T TAGT TGGT TAGAATACT TT CT TCATAGTCACAT T 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
Hi AAAAAATAT AAAAGCT ACCAAT CT T T GTACAAT TT GT AAAT GT TAAGAAT TT T T T TTATAT C TGT 

T AAAT AAAAAT TAT TTCCAACA 
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FIGURE 122 

MRPQGPAASPQRLRGLLLLLLLQLPAPSSASE I PKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T GCC T GGAAGAATACATCAT GT T T T T C GATAAGAAGAAATT GTAGGATCCAGT T TTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACC T T TGAACAATACAAAGGATGGGT T T CAAT G 
T AAT TAGGC TAC T GAGCGGATCAGC T GTAGCAC TGGT TAT AGC CCCCACT G TC T TAC T GACAAT G 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
§»i ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
f J AT AACAG CCT T C AAAAAC T T AAG TAT AAT CAAT T T AAAGG GC T C AAC CAG C T CAC C T GG C TAT AC 

1% CTT GACCATAACCATAT CAGCAATAT T GACGAAAATGCT T T TAAT GGAATACGCAGAC TCAAAGA 

j;f GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
P TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
W CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
If! AGACT GCCGCAACC T GGAACT T T T GGACCT GGGATATAACCGGATCCGAAGTT TAGC CAGGAAT G 

%l TCTTTGCTGGCATGATCAGACTCAAAGAACTTCACC TGGAGCACAAT CAAT TT TCCAAGCTCAAC 

Yl CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
W CATAGGACAGACCAT GTCC TGGACCT GGAGCT CCT TACAAAGGCT T GAT T TAT CAGG CAAT GAGA 

a TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
|:| TCCAACAAGCTC ACAT T TATTGGTCAAGAGAT T TTGGAT TC TT GGATATCCC TCAATGACATCAG 

Tii TCT T GCTGGGAATATATGGGAAT GCAGCAGAAATAT T TGC TCCC T T GT AAAC TGGCTGAAAAGT T 

J| T T AAAGGT CT AAGGGAGAATACAAT TAT C T GTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGT G 

AT C GAT G CAG T GAAGAAC T ACAGC AT C T G T GGCAAAAG TAC T AC AGAGAG G T T T GAT C T GGC CAG 
Til GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
13 TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
m TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
l5# CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 

GGCACAGGAAAAAGAAAAGACAGTCCCT AAAGCAAATGAC T CCCAGCACCCAGGAAT T T TATGTA 
GAT T ATAAACCCACCAACACGGAGACCAGCGAGATGC TGCTGAAT GG GACGGGACCCTGCACC TA 
TAACAAAT CGGGCTCCAGGGAGT GT GAGGTATGAACCAT T G TGATAAAAAGAGC TC TTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCATT T TCC TC TCATACATAATCAACCCAT T GAAAT T TAAATACCACAAT CAAT GTGAAGC T T 
GAACT CCGGTT TAATATAATACC TAT T GT ATAAGACCC TT TACT GAT T CCAT TAATGTCGCAT TT 
GT T T T AAGATAAAAC T TC T T TCATAGGT AAAAAAAAAAA 
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FIGURE 124 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LS LRYNS LQKLKYNQFKGLNQLT WLYLDHNHI SN I DENAFNG I RRLKEL I LS SNRI S YFLNNT FR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARWFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKELQGVNVI DAVKNYS ICGKS T TER FDLARALPKPT FKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cQtiP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 

CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAT^ACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
ATTGAAGAGGAAGGCTTTCTAAAGCTTTGGCAAGGAGTGACACCCGCCATTTACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCAT TATCCCC T T TGGAAAT CAGT CAT TGGAGGGAT GAT GGCT GGT GT TATT GGCCAGT T T 
TTAGCCAATCCAACTGACCTAGTGAAGGTTCAGATGCAAATGGAAGGAAAAAGGAAACTGGAAGG 
AAAACCAT T GCGAT T TCGT GGT GTACAT CAT GCAT TTGCAAAAAT CT TAGCTGAAGGAGGAATAC 
GAG GGCT TTGGGCAGGCT GGGTACC CAATAT ACAAAGAGCAGCACT GGT GAATATGGGAGAT T T A 
ACCACT T AT GATACAGTGAAACAC TACT T GGTAT T GAATACACCACTTGAGGACAATATCATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
T CAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACT T T T GTATAAAT CATCG 
AC TGACTGCT T GATTCAGGCTGT T CAAGGT GAAGGAT TCATGAGT CT ATAT AAAGGC T TT T TACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
T GAGT GGAGTCAGT CCAT T T TAA 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
AP YRGMVRT ALG IIEEEG FLKLWQGVT PAI YRHWYS GGRMVT YEHLRE WFGKSEDEHYPLWKS 
VIGGMMAGVIGQFIANPTDLVKVQMQMEGKRKLEGKPLRFRGTOHAFAKII^GGIRGLWAGWVP 
N I QRAALVNMGDLT T YDTVKHYLVLNTPLEDN IMT HGL S S LCS GLVAS I LGT PADVI KS R IMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
M GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
y AGGAGC GGAAT GTGGACGATGGGCACAT CAT TAACATCAATAGCAT G T CT GGCCACCGAGTGT TA 

]^ CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
10 GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
ill AGACACAAT T CGCC T TCAAACT CCACGACAAGGACCC TGAGAAGGCAGCT GCCACCTATGAGCAA 

& ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
* w CAT CCAGAT T GGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGT GAC T GTGGGAGC TCC 

*«l TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
If! CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 

TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 

if! TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 

III 

TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCC T GGCCCAG TGGATT T CATGGT GATCAT TAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 



riJ 
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FIGURE 128 

MARPGMERWRDRLALVT GAS GG I GAAVARALVQQGLKWGCARTVGN I EE LAAE CKS AG Y PGTL I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGHI IN INSMSGHRVLPLSVTHFYSATKYAVTALTE GLRQELREAQTH I RATC 
I S PGWE TQFAFKLHDKDPEKAAAT YE QMKCLKPEDVAEAVI YVLS TPAH I Q I GDI QMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 

AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
C CG T T GAGTGCT TGAA ATGA GGAAC T GAGAAAAT TAAT T T C TCATGTATT T T TC T CAT T TAT TTA 
T TAAT TTT TAAC TGATAGTTGTACATATTT GGGGGTACATGTGATATT TGGATACATGTATACAA 
T AT ATAAT GATCAAATCAGGGT AACT GGGATATC CAT CACATCAAACAT TT AT T TT T TAT TC TT T 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAAT T T TTG TAT TTT TAGTAGAGACGGGGT T TT GCCAT GT TGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCT C TGATCACCTCAT TC TAC TC T CT ACCT CCATGAGATCCAC TT T T T TAGCT CCCACATGT G 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T T CCATCCATGT TGC TGCAAAT GACAGGAT TTCGTTCT TAAT T T CAAT TAAAATAACCACACATG 
GCAAAAA 



FIGURE 130 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGE S PMPFKRVFCQDGNVRS FCVCAVHFS S HQP PVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 121 



ii 



TT CTGAAGTAACGG AAGCT ACCT T GT AT AAAG AC CT C AAC AC T G CT GACC ATGATC AGCGC AGC CTGG AG C 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT AC CAGAGG AT GCT ACAACTCTCT AC C TT C AGAACAACC AAAT AAAT AAT GCT GGGATT CCTTC AGAT 
TT GAAAAACTT GCT GAAAGT AGAAAG AAT ATACCT AT ACC AC AAC AGT TT AGAT GAAT T TCCTACCAACCT 
CC C AAAGT AT GT AAAAGAGT T ACATT TGC AAG AAAAT AAC AT AAGG AC T ATC AC TT AT GATT CACTT T CAA 
AAAT TC CCT ATC TG GAAGAATT AC AT TT AG AT GACAAC T CTGTCTCT GC AGT T AGC AT AGAAGAGGG AGCA 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GC CC AGGACT AT AGAAGAACT ACGCT TGGATGAT AATCGC AT AT CC ACT AT T TC AT CACC AT CT CTTC AAG 
GTCTCACTAGTCTAAAACGCCTGGTTCTAGATGGAAACCTGTTGAACAATCATGGTTTAGGTGACAAAGTT 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGCACAAACCTGAGGAAGCT TT ATCT TCAAGATAACCACAT C AATCGGGT GC CCCC AAAT GCTT TT T 
CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 
GATGAT TT GGAC AATAT AAC AC AACT GATT CT T CGC AAC AAT CC CTGGTATTGCGGGT GC AAG ATGAAAT G 
GGTAC GTGACTGGT T ACAAT CAC T ACCTGTGAAGGT CAACGTGCGT GGGC TCAT GTGC C AAGCC CCAG AAA 
1 AGGT T C GTGGGATGGC TATT AAGGATCTC AAT GCAGAACTGTTT GAT T GT AAGGAC AGT G GGAT TGT AAGC 

ACC AT T CAGATAACCACT GC AAT ACC CAAC AC AGTGTAT CC T GCCC AAGGAC AGTGGCCAGCTC CAGT GAC 

111 

CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 
-g]; CAAT T AC AAT TACT GT GAAGTCTGTC AC CTCTGAT ACCATT C AT ATCT CTTGG AAACTTGCT CT ACC T ATG 

ACTGCT TT GAGACT CAGCT GGCTT AAAC TG GGCC AT AGCCCGGC ATTT GGATCT AT AACAGAAACAAT TGT 

\-J 

f\'i AACAGGGGAACGCAGT GAGT ACTTGGTC AC AGCC CT GGAGCCTGATTCACCCT AT AAAGT AT GCAT GG TT C 

CC AT GGAAACCAGCAACCTCT ACCT ATT T GAT GAAACTCCTGTT T GTATT GAG ACT GAAACT GC ACCC CTT 
CG AATGTAC AAC CC TACAAC CACCCT CAAT CGAGAGC AAG AGAAAGAACCTT AC AAAAACCCC AAT TT ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTAT 
GCAGAAGCTGGC ACT AAG AAGG AC AACT CT AT CC TGGAAATCAGGGAAACTTCT TT T C AGAT GTT ACCAAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
AC AAAAAC AATC AC AGTGAAAGCAGT AGT AACCG AAGCT AC AGAGACAGT GGT ATT CCAGACTC AGAT CAC 
TC AC ACTC ATGATGCT GAAGGACT CAC AGC AG AC T T GT GTT T TGGGTT TT TT AAACCT AAGGGAGGTGATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVS AVS I EEGAFRDSNYLRLL FLS RNHLS T I PWGLPRT I EELRLDDNRI STISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVOTRGMCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPIJ^IIGGAVALVTIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myri s toylati on si tes . 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Ami da ti on site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 122 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACTCCAACAGAGCCAGGACT ATAT CAACC TCT TC TGCGCCAACATGAT GGACT T GAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGAT GCTTCGGGGAGCCT GATGC TGAAGAT GAAGAAT TAT CT AAAGC TAT TCAA 
TATCAGCAGCAT TT T T CGAGGAGAGT GAAGAGGCGAGAAAAACAAT T T CCAGAT TCTCGCTCTGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
T T TTATAT T TT T T TAGTAGAGACAGGG TT T CATCAT GT TGC TCATGCT GGT CTCGAACT CC T GAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCAC T T TCAAAGAGACT GT GT T GAATAAAGGGCCAAGGT T CT T GCCACCCAGCAC TC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCC GCT GAGACGGACGGT TCCAT GCCAGCTGCCTGGAGGAGGAACAGAC CCC 
T T TAG T CCT CAT CCC T TAGATCC T GGAGGGCACGGATCACAT CCT GGGAAGAAGGCATC TGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 



FIGURE 134 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWABMRGCTQGPLQQSQD 
Y INLFCANMMDLNRRAEAI GYAYPTRD I FMENIMFCGMGGFS D FYKLRWLEAI LS WQKQQEGC FG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
S S WDYRSVPPYLANFY I FLVETGFHHVAHAGLELL I SRDPP TS GS QS VGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 135 



GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGAT TTGATCCCGTTCGAGTGGATATCACT TCGAAAGGAAAAATGAGAGCAAGATAT G 
T GAAT TACAT CAAAACATCAGAGGT T GT CAGAC TGCCC TATCCTCTCCAAATGAAAT C T TCAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACAT GAGACGGGAAATGGAGCAGTCAATGAATATGCTGAAT TCCAACCATGAGTTGCCTGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
If! TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGG TAG TCAGGCCGTCCAGAGCTGGCATTTGCAC 
IP AAACACGGCAACAC TGGGTGGCAT CCAAG T CT TGGAAAACCGTGTGAAGCAAC TACTATAAACT T 

W GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

p. AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 

is J 

Pfl GAGGT CAATAT TGATGT CACTGAAT TAAT TACAGT GTCC TATAGAAAAT GCCAT TAATAAAT TAT 

p ATGAACTACTATACATTATGTATAT TAAT TAAAACATCT TAATCCAGAAATCAAAAAAAAAAAAA 

fll AAAAAAAAAAAAAAA 



III 
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FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGS FWHDI PSGS YWEWS PAYRFDPVRVDI TSKGKMRARYVNYIKTSE 
WRLPYPLQMKSSGPPSYFIKRESWGWTDFIJtfNPMVM^ 
QSM3SIMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site* 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCTGT GTGAT CGCCACAAACCT T CAGGAAATAC GAAAT GGAT T T T CT GAGATACGGGGCAGT G 
TGCAAGCCAAAGATGGAAACAT T GAGAT CAGAATC T TAAGGAGGAC T GAGT CT T T GCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAAC T ACCAGACCCC TGACCAT T ATACT CT CCGGAAGAT CAGCAGCCTCGCCAAT TCC TT T C TTA 
C CAT CAAGAAG GAC C T CC G GC T C T C T CAT G CC C AC AT GACAT G C CAT T GT GGGGAGGAAGCAAT G 
AAGAAATACAGCCAGAT TCTGAGTCACT T T GAAAAGCTGGAACCTCAGGCAGCAGT TG TGAAGGC 
T T T G G GG GAAC T AGAC AT T C T T C T G CAAT GG AT GGAG GAGACAGA ATAGG AG GAAAG T GAT GC T G 
CT GC T AAGAATAT TCGAGGTCAAGAGCT CCAGTC TT CAATACCT GCAGAGGAGGCATGACCCCAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
TGTAATATCTT TC TGC TAT TGGATATAT T TAT TAGT TAATATATT TAT TT AT T T T TT GC TATT TA 
ATGTATTTATT TTTT TACTTGGACATGAAACTT TAAAAAAATTCACAGATTATATTTATAACCTG 
ACTAGAGCAGGTGATGTAT T T T T ATACAGTAAAAAAAAAAAAC CT T GT AAAT TC TAGAAGAGTGG 
C TAGGGGGGT TATTCATT TGTAT TCAACTAAGGACATAT T TACT CAT GCT GATGC TC TGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CC AGGAAT CCTACACGGCCAGCAT GTAT T T CTACAAATAAAGTT TTCT TT GCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 128 



MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
S SREAWQFFLLLWS PDFRPKMKAS S LAFS LLSAAFYLLWT PS TGLKTLNLGSCVI ATNLQE IRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SLANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQWM 
E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 



cAMP- and cGMP-dependent protein kinase phosphorylation sites. 




N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 



amino acids 192-195, 225-228 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAZ^AGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
T AAGACAAGCAAAAGCAGCTGGAAAAAGAATAG TCT T T TATGGAGAT GAAACCT GGGT TAAAT TA 
T TCCCAAAGCATTTTGTGGAATATGATGGAACAACCTCAT TTT TCGTGTCAGAT TACACAGAGGT 
. , GGATAATAAT GTCAC GAGGCAT T T GGATAAAGTAT TAAAAAGAGGAGAT T GGGACATATTAATCC 

l2 TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
P AAGCTGAGCGAGAT GGACAGCGTGCTGATGAAGATCCACACCT CACT GCAG TCGAAGGAGAGAGA 

O GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
ffl GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
*]i CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAG ACGGATGTGGCTGCGACACTGGCGATAGC 
ft ACT TGGC T T ACCGAT TCCAAAAGACAGTGTAGGGAGCCTCC TAT T CCCAGTT GTGGAAGGAAGAC 

y CAATGAGAGAGCAG T TGAGAT T TT TACAT T TGAATACAGT GCAGCT TAGTAAAC TGT TGCAAGAG 

ff! AAT GT GCCGTCATAT GAAAAAGATCCT GGGT T TGAGCAGT TTAAAATGTCAGAAAGAT T GCATGG 

III GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 
I " T TCT CAGGCAGTACCT GGATGCTCT GAAGACGC T GAGC TTGTCCCT GAGT GCACAAGT GGCCCAG 

TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
W CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
111 CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
Q GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
||J AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 

TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
|J CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
III TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 

CACT CAGGAGGGTCAAAAGGAGACTTGGTCGCAC CACT CAT CCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGAT CCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGT TGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C T GACCGAGACACTCACAGCTT T GT CATCAGGGCACAGGCT T C CT CGGAGCCAGGATGATCT GT G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CT GCACACAGTATGTAG T TACCAAAAGAATAAACGGCAATAAT T GAGAAAAAAAA 
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FIGURE 14Q 

MRLG S GT FAT CCVAI E VLG I AVFLRG FFPAPVRS S ARAEHGAE PPAPEPSAGAS SNWTTLPPPLF 
SKWIVLIDALRDDF^FGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTOPRIKAIMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-76 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site, 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTG ATG TTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGTT CCAGATAAAAAC TACATACGCCCAGAGATC T TC T TT GCAT T AGCCTCATC C T TGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T GT GACAAGGATAAAGGACAAAGTCAT CCAT CCC T TCAGC T GAAGAAGGAGAAAC TGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTTGGGGT GACAGATAAAT T TGAGAACAGGAAACACAT T GAAT T T TCATT TCAACCAGT TTGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGAT TAGG AAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT TT GAAC TAAT TGTATAAAAACACCAAACCTGC TCAC T 



111 

O 
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FIGURE 142 



MLLLLLE YNFP I ENNCQHLKT THT FRVKNLNPKKFS IHDQDHKVLVLDSGNL I AVPDKNY IRPE I 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 




Interleukin-1 



amino acids 37-182 



in 
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FIGURE 143 



C TAGAGAGTATAGGGCAGAAGGATGGC AGAT GAGTGAC T CCACAT CC AGAGC T GCC T CCC T TT AA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCAT TTCCACGAGCAAAGGGAC T GT GATGAACACAAT GT CAT GGCT C GT TACC T CCC T GCCA 
CAGT GGAGTT TGCTGTCCACACAT TCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
Q GGGGAGAACTAGGTGTGGGAAATTT GAAGACGACAT TGACAACTGCCAT TTCCAAGAAAGCACAG 

?J AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
C TC CT GAACAAGACC TGC T TGGAGGGAT T CCAC TGAGTGAAACCCAC TCACAGGC T T GTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
ffl T TGGACT TGT T TGT TAT CCTAT TT T GCAT GTGT TT GAGATCTCAGAT CAGT GT T T TAGAAAAT CC 

ly ACACATCTTGAGCCTAATCATGTAGTGTAGATCATTAAACATCAGCAT 
31 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



3 S« : 

5 

111 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQI LL I YAWH FHEQRDCDEHNVMARYLPAT VE FAVHT FNQQS KDY 
YAYRLGH I LNS WKEQVE S KT VFSMELLLGRTRCGKFEDDI DNCHFQES TELNNT FTC FFT I S TRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 



CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
| s l CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
O TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAM 
fc* AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
|^ CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
pf| TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
£f! CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
ly GAGAAGAACACATCAGGCACTGCGCCACC TGC T TCACAGTACT T CCCAACAAC TC TTAGAGGTAG 

* G TGTATT CCCGT TT TACAGATAAGGAAACTGAGGCCCAGAGAGC T GAAGTACT GCACCCAGCAT C 

ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
f ? | GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
f|| CTCTAATGAAAT TGTGAAAGCT CCAT GT TTAGAAATAAAT GAAAACACCT GA 

u 



S if 
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FIGURE 146 

MRKHLSWWWIATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 



Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 157-171 

Iff N-glycosylation sites. 

'4* amino acids 98-102, 110-114 

w 

^ Tyrosine kinase phosphorylation site. 

I??; amino acids 76-83 

W N-myristoylation sites, 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 



Amidation site. 

amino acids 62-66 
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FIGURE 147 



Q 

m 
la 
in 

m 
hi 

5 

Q 
II! 
O 

ru 
o 
ru 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACT TT CAGATTCAGGGGGTACAT GTGAAGGT T TGT T T TATGAGTATAT T GCATGAT GC TGAGG 
TTTGGGGT 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATT AAGTGCAGAAAGACCTT TGAAT GAACAGATTGCT GAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
!T TAAGAAGCTCCCCACTTGATAATAAGTTGAATGTGGAAGATGTTGATTCAACCAAGAATCGAAAACTG 
S% ATC GAT GAT TAT GACT CT AC T AAGAGT GGAT T G GAT C AT AAAT T T CAAGAT GAT C C AGAT GGT C T T CA 

m T CAACTAGACGGGACTCCTT TAACCGCTGAAGACAT TGT CCATAAAATC GCT GCCAGGATT TATGAAG 

||| AAAATGACAGAGC CGTGTT T GACAAGATT GTTTCTAAACTACT TAATC TCGGCCTTATCACAGAA&GC 

111 CAAGCACATACACTGGAAGATGAAGTAGCAGAGGT TT TACAAAAATTAATCT CAAAGGAAGCCAACAA 

f I TTATGAGGAGGAT CCCAATAAGCCCACAAGCT GGACT GAGAAT CAGGC TGGAAAAATACCAGAGAAAG 

W TGAC TCCAATGGCAGCAAT T CAAGATGGT CTT GCTAAGGGAGAAAACGATGAAACAGTAT CTAACACA 

f** TTAACCTT GACAAATGGCT T GGAAAGGAGAACTAAAACCTACAGT GAAGACAACT TTGAGGAACT CCA 

l n ATATTTCCCAAATTTCTATGCGCTACTGAAAAGTATTGATTCAGAAAAA.GAAGCAAAAGAGAAAGAAA 
Q CACT GAT T ACTAT CAT GAAAACAC TGAT T GAC T T T GT GAAGAT GAT GG T GAAAT AT GGAACAAT AT CT 

PJ CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
W AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
1 y CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAA 

GAT GATAACTCCAACCCAGGAGGAAAGACAGAT GAACCCAAAGGAAAAACAGAAGCCTAT TTGGAAGC 
CAT CAGAAAAAATATTGAAT GGTT GAAGAAACATGACAAAAkGGGAAATAAAGAAGATTAT GACCTT T 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGC CATCAAGCGCAT T TATAGCAGCCT GTAAAAAT GGCAAAAGAT CCAGGAGT CTTT CAA 
CTGT TTCAGAAAACATAATATAGCTTAAAACACTT CT AATTCT GT GAT TAAAATTT TTTGACCCAAGG 
GTTAT TAGAAAGTGCTGAAT TTACAGTAGTTAACCT TTTACAAGT GGT TAAAACATAGCT TTCTT CCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE iso 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAET^EEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKI EKERQS IRS S PLDNKLNVEDVDS TKNRKLI DDYDS TKS GLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS IDSEKEAKEKETLI T IMKTLI DFVKMMVKYGT I SPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 
220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-rayristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAAT AACCAGCTT C TAGC TGGAGGGCTGCAT GCAGGGAAGGT CATTAAAGGT GAAGAGAT CAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGAC TAGG GCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGT GGAGGAGACC CATGGCGGACAAT CACT CTCTCTGC T C TCAGGACCCC CACGTC TGAC T TAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGT CAGTAGC T CT CCACATGAAGT CCT GT CAC TCAC CAC T GT GCAGGAGAGGGAGGTGGTCATA 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
GAAAT GGC TCGAGCTCAGAAGATAAAAGAT AAGTAGGGTATGCT GATCCTCT TT TAAAAACCCAA 
GATACAAT CAAAAT CCCAGAT GCTGGT CT C TAT TCCCAT GAAAAAGT GCT CATGACATAT T GAGA 
AGACCTACTTACAAAGTGGCATATAT TGCAAT TTATTT TAATTAAAAGATACCTATTTATATATT 
TCTT TATAGAAAAAAGTCTGGAAGAGTT TACT TCAATTGTAGCAAT GT CAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
AT TC CT T GTATAAAAATAAGAAAAGAAATTAATC TTGAGG TAAGCAGAGCAGACATCATC TC TGA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
T T GT AGTAGTGATCAGGAAACAGAT CTCAGCAAAGCCACTGAGGAGGAGGC T GT GCTGAGTT TGT 
GTGGCTGGAATCTCTGGGTAAGGT^ACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
GAC T GGT T TCC TTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGAC TATGT A 
AAGAT GAAGGCAGAGATCGGAGT T T T GCAGCCACAAGC TAAGAAACACCAAGGAT TGTGGCAACC 
ATCAGAAGC T TGGAAGAGGCAAAGAAGAAT TC TTCCCTAGAGGC T T TAGAGGGATAAC GGCTC T G 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAAT TGGT TACAGCAGCT CTAGGAAAC TAATACAGC T GC TAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
T AGAATAT GGCAGAAGTGAT GGCATGCCACTTCCAAGAT TAGGT TATAAAAGACACT GCAGC T TC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATCGATGTTTGTTGTTTT 
AAG T T G C T CAG T T T T GG T C T AAC T T G T TAT GCAGCAATAGAT AAAT AATAT G CAGAGAAAGAG 



APP_ID= 
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FIGURE 152 

MVLSGALCFRMKDSALKVLYLHNNQLIAGGLHAGKVIKGEEISWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKS FTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWKAP I TDFYFQQCD 



N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 



Interleukin-1 signature. 

amino acids 111-131 



Interleukin-1 proteins. 

Ill 

|f| amino acids 8-29, 83-120, 95-134, 64-103 

Si 
Ml 



•2 SZ. 
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FIGURE 15 3 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGC AATGG CCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
hh TATT GAAGGT GATGACCT GCAT AT CCAGAGGAAT GT GCAAAAGCT GAAGGACACAGTGAAAAAGC 

13 T TGGAGAGAGTGGAGAGATCAAAGCAAT T GGAGAAC TGGAT TT GC T G T T TAT GT C TC TGAGAAAT 

y GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
71 CAAT TAGAT GCCCCAAAGCGAT TT TT TT TAACCAAAAGGAAGAT GGGAAGCCAAACTCCATCAT G 

yij ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
ff! AGACCAGAAGGTAGACTTTCTAAGCATAGATATT TATTGATAACATT TCAT TGTAACTGGTGTTC 

iM TATACACAGAAAACAATT TATT TT TT AAATAAT T GTC TTT T TCCATAAAAAAGAT TAC T T TCCAT 

* gju TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
l^j TGTATTTATTATTATTATAAGACTGCATTTTATT TATATCATT ^TAT TAATATGGATT TATT TAT 

f% AGAAACATCAT TCGATAT TGCTACTT GAGTGTAAGGC TAATATTGATATTTATGACAATAATTAT 

f|| AGAGC T ATAACATGT T TAT T TGACCT CAATAAACAC T TGGATAT CCC 

ru 
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FIGURE 154 



MAALQKS VS S FLMGTLAT S CLLLLALLVQGGAAAP I S SHCRLDKSNFQQP Y I TNRT FMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIBCAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 




N-glycosylation sites. 



amino acids 54-58, 68-72, 97-101 



N-myristoylation sites. 




amino acids 14-20, 82-88 




Prokaryotic membrane lipoprotein lipid attachment site. 



amino acids 10-21 



ru 
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FIGURE iss 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
G C T G C T C TAC C AC AACC AGAC T G T C T T C T AC AGGC GGCCAT GC CAT G G CGAG AAGGGCAC C CACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGC TAGC CGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
TTGACATGGATGATTCTGAGGAGGAAGCTGTTATTGAATGTATAGAGATTTATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 156 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 

CCGGCGATCTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGAC T TGAGGGACCT CCGAGT AGAACCTGT TACAAC TAGT GT T GCAAC AGGGGAC TAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GAT T T GT GT GACGGGCAAAAGCAACTT CCAGTCC TACAGC T GT GT GAGGTGCAAT TACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCT GAACACAGTC TAT T TCAT TGGGGCCCATAATAT T CCTAAT GCAAAT AT GAAT GAAGATGG 
CCCT T CCAT GT C TGTGAAT T TCACCT CACCAGGC TGCC TAGACCACATAAT GAAATATAAAAAAA 
AGT GT GTCAAGGCCGGAAGCCTGTGGGATCCGAACATCAC TGCT TGTAAGAAGAATGAGGAGACA 
GTAGAAGTGAAC T TCACAACCAC TCCCC T GGGAAACAGATACAT GGCT CT TATC CAACACAGCAC 
TAT CATC GGGT T T TC T CAGGTGT TT GAGCCACAC CAGAAGAAACAAACGCGAGC T TCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAASiGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAA.TATGTTTCCA 
T CACACAAT TT GT TACTT CACT GAAT T TC T T CAAAACCAT TGCAGAAGTGAGGTCATCCT T GAAA 
AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGATCTAAGAAGCCAGAT TCATCTGCACAAATACGTGGTGGT CTACTTTAGAGAGATTGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 

MSLVLLSLAMjCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMOTSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNE DG P SMS VN FT S PGCLD H I MKY KKKC V KAG S L W DPN I T AC KKN E ET VE VNFT TT PLGNRYMAL I Q H 
ST I IGFSQVFEPHQKKQTRAS WI PVTGDSEGATVQLTPY FPTCGSDC IRHKGTWLCPQTGVP FPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AGCCACCAGCGCAAC ATGA CAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 

i si: GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 

|3 ACTCAGCTGAAGAAG 

z v. 
*%' ' 

IP 

O 
II! 
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FIGURE 160 



MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC TGAAGATGAGGAAAAGT T T GGAGGAGCAGCT GACT CAGGGGT GGAGGAGC C TAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
^ TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
O GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
Q ACGT GCAT C TGGT T C T GAAT GT C TC T GAGGAGCAGCACT TCGGCC TC T CCC T GTAC T GGAATCAG 

ffi GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 

CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
pi GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
hi TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 

AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
* CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
S3 GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
Iff CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
r| CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 

CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGAC T TAAATAAAGGCAGACGC T GT T T T TC TAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQVVLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLL Y S ADDS G FERLVGALASALCQLPLRVAVDLWS RRELSAQGPVAW FHAQRRQTLQEGGVWLL FSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACAT CCT GACG TGGGACAGC GGGC CAGAGGGCACCCCAGACACGGTC TACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GT CAG T GCGGGAGGCC GGT CAGCCACCAAGATGACT GACAGGT T CAGC T CT CTGCAGCACAC TAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TC TAC CACT TAGAGC T CCAGGT CAACCGCACC TACCAAAT GCACCT TGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
O CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
fl TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
ffj AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
l*[ , TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
w CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
111 GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
ffl GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
|sj CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
^ CACCT TAGGCCTAAAGGTCAGCTT CAGAAAGAGCCACCAGC TGGAAGCTGCATGT TAGGTGGCCT 

TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
O CCCTGGGGATTTGCACAGACAGAACATCTGACCCAAATGTGCTACACAGTGGGGAGGAAGGGACA 
|f| CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
y GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
|*| AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
fjj ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAAT GCAGGGAAAC T CCCGAGGTCCAGAGCCCCACC T C 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAAT CT AGC T CGACAGAGCAT GAGGCCCC T GCC TC T T CTGTCAT TG T T CAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACC^GGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAAT GCAGGTCCACCAGGGAGGGAGACACACAAGCCTTT TCTGCAGGCAGGAGTT TCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAA.TTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE i6a 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 16 s 

TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACA^ 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGG T GACGAT GTATATGT C TAC TACAAAC TGGCAGGGGGATCCC T TGAACT TT GGGC T GGA 
AGT GT T GAACACAGT T T T GGATATT T T CCAAAAGAT T TGATCAAGGT ACTT CATAAATACACGGA 
AGAAGAGCTACATATTCCAGCAGATGAGACAGACTTTGTCTGC T TTGAAGGAGGAAGAGATGAT T 
T T AATAGT TATAAT G TAGAAGAGCT T T TAGGATCT T TGGAACT GGAGGACT CTGTACC TGAAGAG 
TCGAAGAAAGC T GAAGAAGT T T C TCAGCACAGAGAGAAAT C TC CTGAGGAG TCT CGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGAT GC T TACAAAGTCCT GAAAACAGAAAT GAGTCAGAGAGGAAGT GGACAGTGCGT TAT T 
CAT TACAGCAAAGGATT TCGT T GGCATCAAAATC TAAGT T TGT T T T AC AAAGAT TGTTT T TAG TA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 

MAAAPGLLFWLFVLGALWWVPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFTO 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLBCVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

aAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 
CCAGAZ^AAGAAAGTGGCATGGAAGT ATAA TAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 
GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
AC T T CAAAC T T CAAGCAAAT GAAATGGACAATGCAGAT AAAGT TG T TAT CAACACG T CGGGAGTA 
TGTGTGT T AGAAGCAAT T CC T T T TATT TCT T TCAC CT T T CATAAG T T G T TAT C TAG T CAATGTAA 
TGTATAT T GT AT T GAAAT T TACAGTGT GCAAAAG TAT T T TACC T T TGCATAAGT G T T T GATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAAC T TAT T AC T G T T G T C AAC T G AAT T C ACAC AC AC AC AAAT AT AG TAG C AT AG AAAAAG T T T G T 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGCTATTTCATTAAGTGTGATATAAACCTCCTCAAACATTTTACTTAGAGGCAAGGAT 
T GT C T AAT T T CAAT T G T GC AAGACAT GT GCC T TAT AAT TAT T T T T AGC T T AAAAT T AAACAGAT T 
TTGTAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAAGAAG 
TGACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACAT T T TAGGAAGGAAAGGAACTAC GAAAT CGTGTGAAAAT GGGTTGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 

MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPY 

P YLTNQPGDTHQNWVTEAG 1 1 PNL I YWI PT I PLLLL I LVAFGTCC FQMLHKS KGRTKT S PNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites, 

amino acids 86-£9, 255-258 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 2 66-2 69 

N-myristoylation sites, 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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